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Docket No.: 88362/107 

IN THE UNITED STATES PATENT AND TRADEMARK 

OFFICE 

In re plant patent application of 
Danuta Ewa Irena MOSSAKOWSKA et al 
Serial No. To be assigned 
Filed: Concurrently herewith 

For: COMPLEMENT RECEPTOR TYPE 1 (CRl)-LIKE SEQUENCES 

PRELIMINARY AMENDMENT 

Assistant Commissioner of Patents 
Washington, D.C. 20231 

Sir: 

Preliminary to examination please amend the above-identified application as 

follows: 

IN THE CLAIMS 

Please cancel claims 1-27 without prejudice or disclaimer. Please add the following 

claims: 

28. A soluble polypeptide comprising in sequence one to four short consensus 
repeats (SCR) selected from SCR 1, 2, 3 and 4 of long homologous repeat A (LHR-A) as 
the only structurally and functionally intact SCR domains of CR1 and including at least 
SCR3, wherein at least one of the native amino acids are substituted, wherein the 
substitutions are selected from the group consisting of Val at position 4, Asp at position 19, 
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Ser at position 53, Lys at position 57, Ala at position 74, Asp at position 79, Arg at 
position 84, Pro at position 91, Asn at position 109, Lys at position 116, Val at position 
119, Ala at position 132, Thr at position 137, He at position 139, Ser at position 140, Tyr 
at position 143, His at position 153, Leu at position 156, Arg at position 159, Lys at 
position 161, Lys at position 177, Gly at position 230, Ser at position 235, and His at 
position 236. 

29. The polypeptide according to claim 28 that comprises SCR 1, 2, 3 and 4 of 
LHR-A or SCR 1, 2 and 3 of LHR-A as the only structurally and functionally intact SCR 
domains of CR1. 

30. The polypeptide according to claim 28, wherein the polypeptide further 
comprises interdomain sequences of CR1, wherein at least one of the native amino acids 
are substituted, wherein the substitutions are selected from the group consisting of Lys at 
position 59 and He at position 124. 

3 1 . The polypeptide according to claim 28, wherein the polypeptide has formula 

(D: 

NH 2 -V 1 -SCR1-W 1 -SCR2-X , -SCR3-Y 1 -0H (I), wherein 

SCR1 comprises residues 2-58 of mature CR1, SCR2 comprises residues 63-120 of mature 
CR1, SCR3 comprises residues 125-191 of mature CR1, and V 1 , W 1 , X 1 and Y 1 represent 
bonds or short linking sequences of amino acids, and wherein there is at least one 
substitution present in on of the SCRs. 

32. The polypeptide according to claim 31, wherein V 1 , W 1 , X 1 and Y 1 
polypeptide comprises interdomain sequences of CR1, wherein at least one of the native 
amino acids are substituted, wherein the substitutions are selected from the group 
consisting of Lys at position 59 and He at position 124. 
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33. A polypeptide according to claim 31 wherein W\ X 1 and Y 1 comprise 
residues 59-62, 121-124 and 192-196 of mature CR1, and V 1 represents residue 1 of 
mature CR1. 

34. A polypeptide according to claim 28, wherein the polypeptide has formula 

(H): 

NH 2 -V 2 -SCRl-W 2 -SCR2-X 2 -SCR3-Y 2 -SCR4-Z 2 OH (II), wherein 

SCR1 comprises residues 2-58 of mature CR1, SCR2 comprises residues 63-120 of mature 
CR1, SCR3 comprises residues 125-191 of mature CR1, and SCR4 comprises residues 
197-252 of mature CR1, and V 2 , W 2 , X 2 ,Y 2 and Z 2 represent bonds or short linking 
sequences of amino acids. 

35. The polypeptide according to claim 34, wherein V 2 , W 2 , X 2 and Y 2 
polypeptide comprises interdomain sequences of CR1, wherein at least one of the native 
amino acids are substituted, wherein the substitutions are selected from the group 
consisting of Lys at position 59 and lie at position 124. 

36. The polypeptide according to claim 34 wherein W 2 , X 2 , Y 2 and Z 2 represent 
residues 59-62, 121-124, 192-196, and residues 253 of mature CR1, and V 2 represents 
residue 1 of mature CR1. 

37. The polypeptide according to claim 28, wherein the polypeptide has formula 

(HI): 

NH2-X 3 -SCR3-Y 3 -OH (III), wherein 
SCR3 comprises residues 125-191 of mature CR1, wherein at least one of the native amino 
acids are substituted, wherein the substitutions are selected from the group consisting of 
Ala at position 132, Thr at position 137, He at position 139, Ser at position 140, Tyr at 
position 143, His at position 153, Leu at position 156, Arg at position 159, Lys at position 
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161, Lys at position 177, and X 3 and Y 3 represent bonds or short linking sequences of 
amino acids. 

38. The polypeptide according to claim 37 wherein X 3 represents amino acids 
122-124 of mature CR1 and Y 4 represents amino acids 192-196 of mature CR1. 

39. The polypeptide according to claim 28, wherein the polypeptide has formula 

(IV): 

NH 2 -X 4 -SCR3-Y 4 -SCR4-Z 4 -OH (IV), wherein 

SCR3 comprises residues 125-191 of mature CR1, and SCR4 comprises residues 197-252 
of mature CR1, wherein at least one of the native amino acids are substituted, wherein the 
substitutions are selected from the group consisting of Ala at position 132, Thr at position 
137, lie at position 139, Ser at position 140, Tyr at position 143, His at position 153, Leu 
at position 156, Arg at position 159, Lys at position 161, Lys at position 177, Gly at 
position 230, Ser at position 235, His at position 236, and X 4 , Y 4 and Z 4 represent bonds or 
short linking sequences of amino acids. 

40. The polypeptide according to claim 39 wherein X 4 comprises amino acids 
122-124 of mature CR1, Y 4 comprises amino acids 192-196 and Z 4 comprises amino acid 
253 of mature CR1. 

41. The polypeptide according to claim 39, wherein X 4 comprises interdomain 
sequences of CR1, wherein at least one of the native amino acids are substituted, and 
wherein the substitution is He at position 124. 

42. A soluble derivative of a soluble polypeptide that comprises in sequence one 
to four short consensus repeats (SCR) selected from SCR 1, 2, 3 and 4 of long homologous 
repeat A (LHR-A) as the only structurally and functionally intact SCR domains of CR1 and 
including at least SCR3, wherein at least one of the native amino acids are substituted, 
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wherein the substitutions are selected from the group consisting of Val at position 4, Asp at 
position 19, Ser at position 53, Lys at position 57, Ala at position 74, Asp at position 79, 
Arg at position 84, Pro at position 91, Asn at position 109, Lys at position 116, Val at 
position 119, Ala at position 132, Thr at position 137, He at position 139, Ser at position 
140, Tyr at position 143, His at position 153, Leu at position 156, Arg at position 159, Lys 
at position 161, Lys at position 177, Gly at position 230, Ser at position 235, His at 
position 236, 

wherein the derivative comprises at least two heterologous membrane binding 
elements with low membrane affinity covalently associated with the polypeptide, wherein 
the elements are capable of interacting independently and with thermodynamic additivity 
with components of cellular membranes exposed to extracellular fluids. 

43. The derivative according to claim 42, comprising two to eight membrane 
binding elements selected from the group consisting of fatty acid derivatives, ligands of 
integral membrane proteins, sequences derived from the complementarity-determining 
region of monoclonal antibodies raised against epitopes of membrane proteins, and 
membrane binding sequences identified through screening of random chemical libraries. 

44. The derivative according to claim 42 having the following structure: 

[P]-{L-[W]}n-X 

in which: 

P is the soluble polypeptide, 

each L is independently a flexible linker group, 

each W is independently a peptide membrane binding element, 

n is an integer of 1 or more, and 

X is a peptidic or non-peptidic membrane-binding entity which may be 
covalently linked to any W. 



45. A polypeptide derivative having a sequence selected from the group 
consisting of which is SEQ ID NO. 34, SEQ ID NO. 49 and SEQ ID NO. 51. 
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46. A DNA polymer selected from SEQ ID NOS consisting of: 1, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, and 30. 

47. The DNA polymer of claim 46, wherein the polymer is in a replicable 
expression vector. 

48. The DNA polymer of claim 47, wherein the replicable expression vector is 
in a transformed host cell. 

49. A process for preparing a soluble derivative of a soluble polypeptide that 
comprises in sequence one to four short consensus repeats (SCR) selected from SCR 1,2, 
3 and 4 of long homologous repeat A (LHR-A) as the only structurally and functionally 
intact SCR domains of CR1 and including at least SCR3, wherein at least one of the native 
amino acids are substituted, wherein the substitutions are selected from the group 
consisting of Val at position 4, Asp at position 19, Ser at position 53, Lys at position 57, 
Ala at position 74, Asp at position 79, Arg at position 84, Pro at position 91, Asn at 
position 109, Lys at position 116, Val at position 119, Ala at position 132, Thr at position 
137, He at position 139, Ser at position 140, Tyr at position 143, His at position 153, Leu 
at position 156, Arg at position 159, Lys at position 161, Lys at position 177, Gly at 
position 230, Ser at position 235, His at position 236, comprising 

expressing DNA encoding the polypeptide portion of said derivative in a 
recombinant host cell and recovering the product and thereafter post translationally 
modifying the polypeptide to chemically introduce membrane binding elements. 

50. A pharmaceutical composition comprising (A) a therapeutically effective 
amount of a soluble polypeptide comprising in sequence one to four short consensus repeats 
(SCR) selected from SCR 1, 2, 3 and 4 of long homologous repeat A (LHR-A) as the only 
structurally and functionally intact SCR domains of CR1 and including at least SCR3, 
wherein at least one of the native amino acids are substituted, wherein the substitutions are 
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selected from the group consisting of Val at position 4, Asp at position 19, Ser at position 
53, Lys at position 57, Ala at position 74, Asp at position 79, Arg at position 84, Pro at 
position 91, Asn at position 109, Lys at position 116, Val at position 119, Ala at position 
132, Thr at position 137, He at position 139, Ser at position 140, Tyr at position 143, His 
at position 153, Leu at position 156, Arg at position 159, Lys at position 161, Lys at 
position 177, Gly at position 230, Ser at position 235, His at position 236, and (B) 
a pharmaceutically acceptable carrier or excipient. 

51 . A method of treating a disease or disorder associated with inflammation or 
inappropriate complement activation comprising 

aclministering to a subject a pharmaceutical composition comprising (A) a 
therapeutically effective amount of a soluble polypeptide comprising in sequence one to 
four short consensus repeats (SCR) selected from SCR 1, 2, 3 and 4 of long homologous 
repeat A (LHR-A) as the only structurally and functionally intact SCR domains of CR1 and 
including at least SCR3, wherein at least one of the native amino acids are substituted, 
wherein the substitutions are selected from the group consisting of Val at position 4, Asp at 
position 19, Ser at position 53, Lys at position 57, Ala at position 74, Asp at position 79, 
Arg at position 84, Pro at position 91, Asn at position 109, Lys at position 116, Val at 
position 119, Ala at position 132, Thr at position 137, lie at position 139, Ser at position 
140, Tyr at position 143, His at position 153, Leu at position 156, Arg at position 159, Lys 
at position 161, Lys at position 177, Gly at position 230, Ser at position 235, His at 
position 236, and (B) a pharmaceutically acceptable carrier or excipient. 
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REMARKS 



Applicants have canceled claims 1-27 without prejudice or disclaimer to the subject 
matter recited therein, and expressly reserve all rights to such subject matter. The 
Examiner is respectfully requested to enter claims 28-51 prior to examination. Claims 28- 
51 are presented in order to remove multiply dependent claims and correct informalities. 

Support for the claims can be found throughout the application. See, for example, 
pages 5-11. Applicants await a first office action on the merits. 



FOLEY & LARDNER 
3000 K Street, N.W. 
Suite 500 

Washington, D.C. 20007-5109 
Tel: (202) 672-5300 
Fax: (202) 672-5399 



Respectfully submitted, 



September 2. 1999 




/John P. Isacson 
Reg. No. 33,715 
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COMPLEMENT RECEPTOR TYPE 1 (CR1>-LIKE SEQUENCES 

This invention relates to novel polypeptides and their derivatives which act as 
inhibitors or regulators of complement activation and are of use in the therapy of diseases 
involving complement activation such as various inflammatory and immune disorders. 

Constituting about 10% of the globulins in normal serum, the complement system 
is composed of many different proteins that are important in the immune system's 
response to foreign antigens. The complement system becomes activated when its 
primary components are cleaved and the products alone or with other proteins, activate 
additional complement proteins resulting in a proteolytic cascade. Activation of the 
complement system leads to a variety of responses including increased vascular 
permeability, chemotaxis of phagocytic cells, activation of inflammatory cells, 
opsonization of foreign particles, direct killing of cells and tissue damage. Activation of 
the complement system may be triggered by antigen-antibody complexes (the classical 
pathway) or, for example, by iipopoiysaccharides present in ceil walls of pathogenic 
bacteria (the alternative pathway). 

Complement activation (CA) is known to occur in a wide variety of acute 
inflammatory processes particularly those associated with ischaemia and reperfusion 
injury (Rossen et aL, 1985 Circ. Res., 57, 1 19,; Morgan B.P., 1990 The biological effects 
of complement activation. In 'Complement, Clinical Aspects and Relevance to Disease'. 
Academic Press. London.) 

It is also generally accepted that at least some of the components of the classical 
complement cascade can be detected by immunohistochemical methods in close 
association with senile plaques in the brains of sufferers from Alzheimer's disease 
(Eikelenboom et al., 1994, Neuroscience, 59, 561-568) and that complement activation 
plays a role in the inflammatory component of this condition.. 

Complement receptor type 1 (CR1) has been shown to be present on the 
membranes of erythrocytes, monocytes/macrophages, granulocytes, B ceils, some T ceils, 
splenic follicular dendritic ceils, and glomerular podocytes. CR1 binds to the 
complement components C3b and C4b and has also been referred to as the C3b/C4b 
receptor. The structural organisation and primary sequence of one allotype of CR1 is 
known (Kiickstein et aL, 1987, J. Exp. Med. 165:1095-1 1 12, Klickstein et aL 1988, J. 
Exp. Med. 168:1699-1717; Hourcade et aU 988, J* Exp. Med. 168:1255-1270, WO 
89/09220, WO 91/05047). It is composed of 30 short consensus repeats (SCRs) that each 
contain around 60-70 amino acids. In each SCR ? around 29 of the average 65 amino 
acids are conserved. Each SCR has been proposed to form a three dimensional triple loop 
structure chrough disulphide linkages with the third and first and the fourth and second 
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half-cystines in disulphide bonds. CRl is farther arranged as 4 long homologous repeats 
(LHRs) of 7 SCRs each. Following a leader sequence, the CRl molecule consists of the 
N-terminai LHR-A, the next two repeats, LHR-B and LHR-C, and the most C-terminal 
LHR-D followed by 2 additional SCRs, a 25 residue putative transmembrane region and a 
43 residue cytoplasmic tail. 

Based on the mature CRl molecule having a predicted N-terminal glutamine 
residue, hereinafter designated as residue 1, the first four SCR domains of LHR-A are 
defined herein as consisting of residues 2-58, 63-120, 125-191 and 197-252, respectively, 
of mature CRl. 

Hourcade et aL, 1988, J. Exp. Med. 168:1255-1270 observed an alternative 
polyadenyiation site in the human CRl transcriptional unit that was predicted to produce 
a secreted form of CRL The mRNA encoded by this truncated sequence comprises the 
first 8.5 SCRs of CRl, and encodes a protein of about 80 kDa which was proposed to 
include the C4b binding domain. When a cDNA corresponding to this truncated 
sequence was transfected into COS ceils and expressed, it demonstrated the expected C4b 
binding activity but did not bind to C3b (Krych et aL, 1989, FASEB J. 3:A368; Krych et 
aL Proc. Nat. Acad. Sci. 1991, 88, 4353-7). Krych et aL, also observed a mRNA similar 
to the predicted one in several human cell lines and postulated that such a truncated 
soluble form of CRl with C4b binding activity may be synthesised in humans. 

In addition, Makrides etal (1992, J. Biol. Chem. 262 (34) 24754-61) have 
expressed SCR 1 + 2 and 1+2 + 3+4 of LHR-A as membrane-attached proteins in CHO 
ceils. 

Several soluble fragments of CRl have also been generated via recombinant DNA 
procedures by eliminating the transmembrane region from the DNAs being expressed 
(WO 89/09220, WO 91/05047). The soluble CRl fragments were functionally active, 
bound C3b and/or C4b and demonstrated Factor I cofactor activity depending upon the 
regions they contained. Such constructs inhibited in vitro complement-related functions 
such as neutrophil oxidative burst, complement mediated hemolysis, and C3a and C5a 
production. A particular soluble construct, sCRl/pBSCRlc, also demonstrated in vivo 
activity in a reversed passive Arthus reacdon (WO 89/09220, WO 9 1/05047; Yeh et aL 
1991, J. Immunol. 146:250), suppressed post-ischemic myocardial inflammation and 
necrosis (WO 89/09220, WO 91/05047; Weisman et aL Science, 1990, 249:146-1511; 
Dupe, R. et aL Thrombosis & Haemostasis (1991) 65(6) 695.) and extended survival rates 
following transplantation (Pruitt & Bollinger, 1991, J. Surg. Res 50:350; Pruitt et aL f 
1991 Transplantation 52; 868). Furthermore, co-formulation of sCRl/pBSCRlc with p- 
anisoyiated human piasminogen-streptokinase-activator complex (APSAC) resulted in 
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similar anti-haemolytic activity as sCRl alone, indicating that the combination of the 
complement inhibitor sCRl with a thrombolytic agent was feasible (WO 91/05047). 

In a model of antibody-mediated demyeiinating experimental allergic 
encephalomyelitis (ADEAE), systemic inhibition of CA using sCRl over 6 days, 
produced improvements in clinical score and blocked CNS inflammation, demyelination 
and deposition of complement components (Piddlesden et aL, 1994, J. Immunol. 152, 
5477). ADEAE can be regarded as a model of acute relapse in multiple sclerosis (MS) 
and these striking results suggested possible applications for sCRl in MS therapy despite 
the high molecular weight (245 kilodaitons) of this agent. 

In a rat model of traumatic brain injury, complement inhibitor sCRl (also known 
as TP10 or BRL55730) was shown to reduce myeloperoxidase activity (an indicator of 
neutrophil accumulation) following traumatic injury (Kaczorowska et al, 1995, J. 
Cerebral Blood Flow and Metabolism, 15, 860-864). This is suggested as demonstrating 
that complement activation is involved in the local inflammatory response. 

Soluble polypeptides corresponding to part of CR1 having functional complement 
inhibitory, including anti-haemoiyxic activity, have been described in WO94/00571 
'comprising, in sequence, one to four short consensus repeats (SCR) selected from SCR 1, 
2, 3 and 4 of long homologous repeat A (LHR-A) as the only structurally and functionally 
intact SCR domains of CR1 and including at least SCR3. 

Pseudo genes are usually defined as DNA sequences which possess a high degree 
of homology to genes with identified function but which are not expressed. The origins 
of the lack of transcription and translation vary but are commonly the presence of 
accumulated mutations which inactivate transcriptional initiation sites, disrupt RNA 
splicina or introduce frame-shift mutations and premature termination codons. 
Pseudogenes are sometimes regarded as genetic relics which have been isolated within 
the genome through a primary loss of expressabiiity and which have subsequently 
mutated randomly in situ to highly aberrant forms. There is a frequent presumption that 
pseudogene sequences, if expressable at all, will not be functionally active because of an 
accumulation of deleterious in-frame mutations. However, studies of immune system 
genetics suggest that pseudogenes may act as a source of diversity in somatic mutation 
processes and that non-expressed sequences may recombine with normally expressed 
genes to create functional variants with a conserved framework. This phenomenon has 
been documented in immunoglobulin VL and VH genes and elsewhere (W.T.McCormack 
et aL, Genes Dev. 4, 548-58, 1990). 

The creation of pseudogenes through reverse transcription followed by DNA 
integration is also known. In such cases, the integrated sequences (which can in principle 
originate from organisms other than the host) lack introns and may be sited in 
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chromosomal locations distant from the expressed gene to which they are homologous 
because integration into the genome can occur at random sites. Such genes are known as 
processed pseudogenes. The presence of pseudogenes in chromosomal clusters with 
homologous expressed genes argues against them being processed pseudogenes because 
of the improbability of a random integration process giving rise to close physical 
clustering in a large genome. 

The existence of a gene homologous to that for complement receptor type 1 (CR1) 
was first reported by Hourcade et al. (J.BioLChem, 275, 974-80, 1990), who found it 
associated with a gene cluster on chromosome Iq 32 termed the Regulators of 
Complement Activation (RCA) cluster. The latter comprises the CR1 gene itself and 
those encoding decay accelerating factor (DAF), membrane cofactor protein (MCP), 
Factor H, complement receptor type 2 (CR2) and C4 binding protein. This 'CR1-Iike f 
gene was predicted to encode a protein containing seven SCR regions corresponding (by 
closest homology) to SCRs 1-6 and 9 of LHR-A (1-6) or LHR-B (9) of CR1 itself. The 
overall homology with the above regions of CR1 at the predicted amino acid level is 91% 
and the sequence divergence is greatest in the first three SCRs. 

ALIGNMENT OF CR1-LIKE GENE (Cripse) WITH THE HUMAN CR1 SEQUENCE 
(Crl.Pe, LHRA region). 

(NB. CR1 numbering includes the signal sequence) 

30.7% idencicy in 446 aa overlap 

10 20 30 

Cripse QOTVPEWLPFARJPTNLTDDFSFP XGTYLNY 

lihllMilimillhiillllMMI 
en . ?e vgppapglpfccggsllavvvllalpvawgq 

20 30 40 50 SO 70 

40 50 50 70 30 90 

Cripse ECRPGYSGRPFS XXCLKNSVWTSAXDKCKRKSCHNPPDPWGMAKVXKDIQFRSQXKYSC 

llllNMMlMillMlllhllhhllilinMilllhllihll! Ililll! 

Crl . Pe E CRPGYSGRP FS 1 1 CLKNS VWTGAKDRCRRKS CRNP P D P VNGMVHVX KG I Q FGSQ 1 KYS C 
30 90 LOQ L10 120 130 

LOO 110 120 130 140 ISO 

Cripse PKGYRLXGSSSATCIISGNTVXWDmCTPVCDRrrCGLPPTIANGDFTSISREYFHYGSVV 

HI! II I II I II Ml II I HI Nihil HI II i 1 1 ! 1 1 ! = i M I : I = = 1 1 1 1 Mill 

• - Crl.Pe TXGYRLXGSSSATCIISGDTVIWDNETPICDRIPCGLPPTITNGDFISTNRENFHYGSVV 
140 ISO ISO 170 130 190 

ISO 170 13Q 190 200 210 

Crlnse TYKGMLGS RGX2CVFEL VGEP S I YCTS XDDQVG IWSG P APQC X I PNKCT? PNVENG X LVS D 

IhM H hlNMlMllMllhllMliMllimillllllinilllllM! 

Crl . Pe TYRCNPGSGGRKVFHLVGEPSIYCTSNDOQVGIWSGPAPQCXXPNKCTPPNVENGXLVSD 
200 210 220 230 240 250 

220 230 240 2S0 2S0 270 

Cripse NRS LFS I*NE WEFRCQ PGFGMKGPSHVKCQALNKWE P SLPS CS RVCQP ? P DVLHAERTQ R 

MIIMIlMMMIllil I[!h:|[!!IMIMMMIIIII!!!lilMt[ll)M 

Crl . ?e MRS LFS LNEWEFRCQ PG FVMKG? RRVKCQALNKWE? EL PS CS RVCQ P ? P DVLHAERTQ R 
260 270 230 290 300 310 
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230 290 300 310 320 330 

Crlose DKDNFS PGQEVFYSCEPGYDLRGSTYLHCTPQGOWS PAAPRCEVKSCDD FI*GQLP?TGHVL 

li!illl!!i!ll!!ill!ll[h: ; '1 1 1 1 1 i 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 = | i I IhM 

Crl . Pe D KDNFS PGQ E VF YS CEPGYD LRGAASMRCTPQGD WS ? AAPTCE VKSCD D FMGQ LLNGR VL 
320 330 340 350 350 370 

340 350 3SQ 370 330 390 

Crlose FPLNLQLGAKVD FVCD EG FQ L KGSSAS YCVLAGMESL WNSS VP VCZRESCXTP PVPVNGM 

IhllMillllNlMIIIUlllMIIIMIIIIIiMMMM: I ! 

Crl . Pe ??VNLQLGAKVT)FVCDEGFQLXGSSASYCV1^ 

380 390 400 410 420 430 

400 410 420 430 

Crlose - - VHVITDIHVGSRINYSC TTGKRLIGHSSAECIL-SGNTAKWSMKPPICQ 

: - : = h :||=| :|: 111=1= 1= : II :| | 

Crl .Pe HTGKPLEVF? FGKAVNYTCD PKPDRGTS FDLIGESTIRCTSDPQGNGVWSS PAP RCG 

440 4S0 460 470 480 490 

The CRl-Iike gene also encodes a signal peptide but not a transmembrane or 
cytoplasmic region and contains an intron-exon structure similar to the CR1 gene. There 
is currently no evidence that the CRl-like gene is expressed and neither mRNA 
transcripts nor a soluble protein have been isolated. The origin of the CRl-like sequence 
may lie in a gene duplication event in an ancestral CR1 gene which was followed by 
divergence and transcriptional inactivation of the CRl-like gene. It appears probable that 
the CRl-like gene is currently a pseudogene although not of the processed type. 

It has now been found that replacement of codons in DNA encoding the first three 
SCRs of LHR-A of CR1 with others encoding the predicted aminoacids in the CRl-like 
sequence can give rise to chimeric genes which can be expressed to give active 
complement inhibitors with functional complement inhibitory, including anti-haemo lytic, 
activity. 

According to the present invention there is provided a soluble polypeptide 
comprising, in sequence, one to four short consensus repeats (SCR) selected from SCR 1, 
2, 3 and 4 of long homologous repeat A (LHR-A) as the only structurally and functionally 
intact SCR domains of CR1 and including at least SCR3, in which one or more of the 
native amino acids are substituted with the following: 

Val 4, Asp 19, Ser 53, Lys 57, Ala 74, Asp 79, Arg 84, Pro 91, Asn 109, Lys 116, Val 
1 19, Ala 132, Thr 137, He 139, Ser 140, Tyr 143, His 153, Leu 156, Arg 159, Lys 161, 
Lys 177, Gly 230, Ser 235, His 236. 

(Numbering is from glutamine as residue 1 of mature CRL The amino-acid indicated is 
that which replaces the CRI residue at the position specified.) 

In preferred aspects, the polypeptide comprises, in sequence, SCR 1, 2, 3 and 4 of 
LHR-A or SCR 1, 2 and 3 of LHR-A as the only structurally and functionally intact SCR 
domains of CRI with the modification(s) described above. 
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It is to be understood that additional variations in the amino acid sequence of the 
polypeptide of the invention by way of addition, deletion or conservative substitution of 
residues, including allelic variations, in which the biological activity of the polypeptide is 
retained, are encompassed by the invention. Conservative substitution is understood to 
mean the retention of the charge and/or size characteristics of the amino acid side chain, 
for example arginine replaced by lysine or glutamine . 

In one aspect, the polypeptide of the invention may be represented symbolically 

as follows: 

NH 2 -V l -SCR1-Wi-SCR2-X l -SCR3-Yl-0H (I) 
in which SCR1 represents residues 2-58 of mature CR1, SCR2 represents residues 63-120 
of mature CR1, SCR3 represents residues 125-191 of mature CR1, and containing at least 
one of the substitutions as aforesaid and V*. W*. X 1 and Y 1 represent bonds or short 
linking sequences of amino acids, preferably 1 to 5 residues in length and which are 
preferably derived from native interdomain sequences in CR1. 

The native interdomain sequences in CR1 may also be substituted with the 
corresponding predicted aminoacids in the CRl-like sequence, namely Lys59 and/or 
Tie 124. (Numbering is from glutamine as residue 1 of mature CR1. The amino-acid 
indicated is that which replaces the CR1 residue at the position specified.) 

In a preferred embodiment , the SCR3 domain of formula (I) is substituted with 
all ten residues found in the corresponding pseudogene sequence, namely (in single letter 
code): 

A132, T137, 1139, S140, Y143, H153, L156, R159, K161, K177 (Sequence Group 1) 
and the remaining domains have the sequence of mature CRI. 

In a further preferred embodiment of formula (I), W 1 , X 1 and Y 1 represent 
residues 59-62, 121-124 and 192-196, respectively, of mature CRI, optionally substituted 
as aforesaid and V* represents residue i of mature CRI optionally linked via its N- 
terminus to methionine. 

In another aspect the polypeptide of the invention may be represented 

symbolically as follows: 

NH 2 -V 2 -SCRl-W 2 -SCR2-X 2 -SCR3-Y 2 -SCR4-Z 2 OH (II) 

' in which SCR1, SCR2 and SCR3 are as hereinbefore defined, SCR4 represents residues 

197-252 of mature CRI and containing at least one of the substitutions as aforesaid, and 

y2 ) w 2 , x 2 , Y 2 and Z 2 represent bonds or short linking sequences of amino acids, 

preferably 1 to 5 residues in length and which are preferably derived from native 

interdomain sequences in CRI, optionally substituted as aforesaid. 
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In a preferred embodiment of formula (II)), the SCR3 region is substituted with 
the aforementioned Sequence Group I residues and the remaining domainshave the 
sequence of mature CRI. 

In further preferred embodiments of formula (II), X 2 , Y 2 and Z 2 represent 
residues 59-62 ? 121-124, 192-196, and residues 253 respectively, of mature CRL 
optionally substituted as aforesaid, and V 2 represents residue 1 of mature CRI optionally 
linked via its N-terminus to methionine. 

In one particular embodiment of formula (II) arginine 235 is replaced by 
histidine. 

In the preferred embodiment of formula (II), residue 235 is arginine. 

In one further aspect, the polypeptide of the invention may be represented 
symbolically as follows: 

NH 2 -X 3 -SCR3-Y3-OH (III) 
in which SCR3 is as hereinbefore defined, containing at least one of the substitutions as 
aforesaid, and in a preferred embodiment, all those of Sequence Group 1, and X 3 and Y^ 
represent bonds or short linking sequences of amino acids, preferably 1 to 5 residues in 
length and which are preferably derived from native interdomain sequences in CRI, 
optionally substituted as aforesaid. 

In a further preferred embodiment of formula (III) X 3 represents amino acids 122- 
124 of mature CRI, optionally substituted as aforesaid, optionally linked to methionine at 
its N-terminus and Y 4 represents amino acids 1 92- 1 96 of mature CRL 

In another further aspect, the polypeptide of the invention may be represented 
symbolically as follows: 

NH2-X 4 -SCR3-Y 4 -SCR4-Z 4 -OH (IV) 
in which SCR3 and SCR4 are as hereinbefore defined containing at least one of the 
substitutions as aforesaid and X 4 , Y 4 and Z 4 represent bonds or short linking sequences 
of amino acids, preferably 1 to 5 residues in length and which are preferably derived from 
native interdomain sequences in CRI, optionally substituted as aforesaid. • 

In a preferred embodiment of formula (IV), the SCR3 region is substituted with 
the residues of Sequence Group I and the remaining domainshave the sequence of mature 
CRI. 

In a further preferred embodiment of formula (IV) X 4 represents amino acids 122- 
124 of mature CRI, optionally substituted as aforesaid, optionally linked to methionine at 
its N-terminus and Y 4 and Z 4 represent amino acids 192-196 and 253 respectively of 
mature CRL 
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The soluble polypeptides of the invention lack the membrane binding capability 
of the full length CR1 proteins which properties may be advantageous to the therapeutic 
activity. 

The main classes of interaction of proteins with membranes can be summarised as 
follows: 

1. Direct and specific interactions with phospholipid head groups or with other 
hydrophilic regions of complex lipids or indirectly with proteins already inserted in the 
membrane. The latter may include all the types of intrinsic membrane protein noted 
below and such interactions are usually with extracellular domains or sequence loops of 
the membrane proteins; 

2. Through anchoring by a single hydrophobic transmembrane helical region near 
the terminus of the protein. These regions commonly present a hydrophobic face around 
the entire circumference of the helix cylinder and transfer of this structure to the 
hydrophilic environment of bulk water is energetically unfavourable. 

3. Further anchoring is often provided by a short sequence of generally cationic 
aminoacids at the cytoplasmic side of the membrane, C -terminal to the transmembrane 
Helix. The membrane-binding properties of CR1 are provided by features 2 and 3. 

4. Through the use of multiple (normally 2-12 and commonly 4,7 and 10) 
transmembrane regions which are usually predicted to be helical or near-helical. 
Although these regions are normally hydrophobic overall, they frequently show some 
amphipathic behaviour - an outer hydrophobic face and an inner more hydrophilic one 
being identifiable within a helix bundle located in the lipid bilayer; 

5. Through postranslationaliy linked phosphatidyl inositol moieties (GPI- 
anchors). These are generated by a specific biosynthetic pathway which recognises and 
removes a specific stretch of C-terminal aminoacids and"creates a membrane-associating 
diacyi glycerol unit linked via a hydrophilic carbohydrate spacer to the polypeptide; 

6. In a related process, single fatty acid groups such as myristoyl, palmitoyi or 
prenyl may be attached postranslationaliy to one or more sites in a protein (usually at N- 
or C-termini). Again, amino acids (such as the C-terminal CAAX box in Ras proteins) 
may be removed. 

The present invention further provides a soluble derivative of the soluble 
polypeptide of the invention, said derivative comprising two or more heterologous 
membrane binding elements with low membrane affinity covalently associated with the 
polypeptide which elements are capable of interacting independently and with 
thermodynamic additivity with components of cellular membranes exposed to 
extracellular fluids. 
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By 'heterologous 1 is meant that the elements are not found in the native full length 
CR1 protein. 

By 'membrane binding element with low membrane affinity 1 is meant that the 
element has significant affinity for membranes, that is a dissociation constant greater 
thanl uM, preferably 1 uM-lmM. The elements preferably have a size <5kDa. 

The derivative should incorporate sufficient elements with low affinities for 
membrane components to result in a derivative with a high (preferably 0.01 - IOnM 
dissociation constant) affinity for specific membranes. The elements combine so as to 
create an overall high affinity for the particular target membrane but the combination 
lacks such high affinity for other proteins for which single elements may be (low-affinity) 
ligands. 

The elements should be chosen so as to retain useful solubility in pharmaceutial 
formulation media, preferably >100ug/mL Preferably at least one element is hydrophilic. 

The further embodiment of the invention thus promotes localisation of the 
polypeptide of the invention at cellular membranes and thereby provide one or more of 
several biologically significant effects with potential therapeutic advantages including: 

Potency: an increase in effective concentration may result from the reduction in 
the diffusional degrees of freedom. 

Pharmacokinetics and dosing frequency: Interaction of the derivatised 
polypeptide with long-lived cell types or serum proteins would be expected to prolong the 
plasma residence time of the polypeptide and produce a depot effect through deposition 
on cell surfaces. 

Specificity: Many clinically important pathological processes are associated with 
specific ceil types and tissues (for example the vascular endothelium and the recruitment 
thereto of neutrophils bearing the sialyl Lewis x antigen to ELAM-1, see below). Hence 
targeting the modified polypeptide to regions of membrane containing pathology- 
associated membrane markers may improve the therapeutic ratio of the protein targeted. 

It will be appreciated that all associations of heterologous amino acid sequences 
with the polypeptide will need to be assessed for potential immunogenicity, particularly 
where the amino acid sequence is not derived from a human protein. The problem can be 
minimised by using sequences as close as possible to known human ones and through 
computation of secondary structure and antigenicity indices. 

The derivative preferably comprises two to eight, more preferably two to four 
membrane binding elements. 

Membrane binding elements are preferably selected from: fatty acid derivatives 
such as fatty acyl groups; basic amino acid sequences; ligands of known integral 
membrane proteins; sequences derived from the complementarity-determining region of 
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monoclonal antibodies raised against epitopes of membrane proteins; membrane binding 
sequences identified through screening of random chemical libraries. 

The selection of suitable combination of membrane binding elements will be 
guided by the nature of the target cell membrane or components thereof 

Suitable fatty acid derivatives include myristoyl (12 methylene units) which is 
insufficiently large or hydrophobic to permit high affinity binding to membranes. Studies 
with myristoylated peptides (eg R.M.Peitzsch & S.McLaughiin, Biochemistry, 32, 10436- 
10443, 1993)) have shown that they have effective dissociation constants with model 
lipid systems of -10" 4 M and around 10 of the 12 methylene groups are buried in the 
lipid bilayer. Thus, aliphatic acyl groups with between about 8 and 18 methylene units, 
preferably 10-14, are suitable membrane binding elements. Other examples of suitable 
fatty acid derivatives include long-chain (8-18, preferably 10-14 methylene) aliphatic 
amines and thiols, steroid and famesyl derivatives. 

Membrane binding has been found to be associated with limited (single-site) 
modification with fatty acyl groups when combined with a cluster of basic aminoacids in 
the protein sequence which may interact with acidic phospholipid head groups and 
provide the additional energy to target membrane binding. This combination of effects 
has been termed the 'myristoyi-electrostatic switch' (S.McLaughiin and A.Aderem, TIBS, 
20,272-276, 1994; JJF.Hancock et al, Cell, 63, 133-139,1990). Thus, a further example 
of suitable membrane binding elements are basic aminoacid sequences such as those 
found in proteins such as Ras and MARCKS (myristoylated alanine-rich C-kinase 
substrate, PJ. Biackshear, J. Biol. Chem., 268, 1501-1504, 1993) which mediate the 
electrostatic 'switch' through reversible phosphorylation of serine residues within the 
sequence and a concomitant neutralisation of the net positive charge. Such sequences 
include but are not restricted to consecutive sequences of Lysine and Arginine such as 
(Lys)n where n is between 3 and 10, preferably 4 to 7. 

Suitable examples of amino acid sequences comprising basic amino acids include: 

i) DGPKKKKKKSPSKSSG 

ii) GSSKSPSKKKKKKPGD 

iii) SPSNETPKKKKKRFSFKKSG 
(N-terminus on left) 

Sequences i) to iii) are examples of electrostatic switch sequences. 

Examples of amino acid sequences include RGD-containing peptides such as 
GRGDSP which are ligands for the ai&fc integrin of human platelet membranes. Further 
examples of such sequences include those known to be involved in interactions berween 
membrane proteins such as receptors and the major histocompatibility complex. An 
example of such a membrane protein iigand is the sequence GNEQSFRVDLRTLLRYA 
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which has been shown to bind to the major histocompatibility complex class 1 protein 
(MHC-1) with moderate affinity (L.Olsson et al Proc. Natl .Acad.Sci.USA. 91, 9086- 
909, 1994). 

An example of a ligand for an integral membrane protein is the carbohydrate 
ligand Sialyl Lewis* which has been identified as a ligand for the integral membrane 
protein ELAM-1 (MX.Philiips et al Science, 250, 1 130-1 132, 1990 & G.Walz et al 
Ibid, 250, 1132-1135,1990). 

Sequences derived from the complementarity <teteimining regions of monoclonal 
antibodies raised against epitopes within membrane proteins (see, for example, J.W.Smith 
et al f J.BioLChem. 270, 30486-30490, 1995) are also suitable membrane binding 
elements, as are binding sequences from random chemical libraries such as those 
generated in a phage display format and selected by biopanning operations in vitro 
(G.F.Smith and J.K.Scott, Methods in Enzymoiogy, 217H, 228-257,1993) or in vivo 
(RJPasqualini & E.Ruoslahti, Nature, 380, 364-366, 1996). 

Optionally, conditional dissociation from the membrane may be incorporated into 
derivatives of the invention using mechanisms such as pH sensitivity (electrostatic 
switches), regulation through metal ion binding (using endogenous Ca 2 *, Zn 2+ and 
incorporation of ion binding sites in membrane binding elements) and protease cleavage 
(e.g plasminolysis of lysine-rich membrane binding sequences to release and activate 
prourokinase) 

Preferred derivatives of this invention have the following structure: 
[P]-{MW]} n -X 

in which: 

P is the soluble polypeptide, 

each L is independently a flexible linker group, 

each W is independently a peptidic membrane binding element, 

n is an integer of 1 or more and 

X is a peptidic or non-peptidic membrane-binding entity which may be covalently 
linked to any W. 

Peptidic membrane binding elements are preferably located sequentially either at 
the N or C terminus of the soluble polypeptide and are preferably 8 to 20 amino acids 
long. The amino acid sequences are linked to one another and to the soluble peptide by 
linker groups which are preferably selected from hydrophilic and/or flexible aminoacid 
sequences of 4 to 20 aminoacids: linear hydrophilic synthetic polymers; and chemical 
bridging groups. 
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In a further aspect, the invention provides a process for preparing a polypeptide 
according to the invention which process comprises expressing DNA encoding said 
polypeptide in a recombinant host cell and recovering the product 

In particular, the process may comprise the steps of: 

i) preparing a replicable expression vector capable, in a host cell, of expressing a 
DNA polymer comprising a nucleotide sequence that encodes said polypeptide; 

ii) transforming a host ceil with said vector; 

iii) culturing said transformed host ceil under conditions permitting expression of said 
DNA polymer to produce said polypeptide; and 

iv) recovering said polypeptide. 

The DNA polymer comprising a nucleotide sequence that encodes the polypeptide 
also forms pan of the invention. 

The process of the invention may be performed by conventional recombinant 
techniques such as described in Sambrook et aL, Molecular Cloning : A laboratory 
manual 2nd Edition. Cold Spring Harbor Laboratory Press (1989) and DNA Cloning 
vols L II and III (D. M. Glover ecL, IRL Press Ltd). 

The invention also provides a process for preparing the DNA polymer by the 
condensation of appropriate mono-, di- or oligomeric nucleotide units. 

The preparation may be carried out chemically, enzyrnatically, or by a 
combination of the two methods, in vitro or in vivo as appropriate. Thus, the DNA 
polymer may be prepared by the enzymatic ligation of appropriate DNA fragments, by 
conventional methods such as those described by D. M. Roberts et aL, in Biochemistry 
1985,24,5090-5098. 

The DNA fragments may be obtained by digestion of DNA containing the 
required sequences of nucleotides with appropriate restriction enzymes, by chemical 
synthesis, by enzymatic polymerisation, or by a combination of these methods. 

Digestion with restriction enzymes may be performed in an appropriate buffer at a 
temperature of 20°-70°C, generally in a volume of 50p.i or less with 0.1-lOp.g DNA. 

Enzymatic polymerisation of DNA may be carried out in vitro using a DNA 
polymerase such as DNA polymerase I (Klenow fragment) in an appropriate buffer 
containing the nucleoside triphosphates dATP, dCTP, dGTP and dTTP as required at a 
temperature of 10°-37°C, generally in a volume of 50ul or less. 

Enzymatic ligation of DNA fragments may be carried out using a DNA ligase 
such as T4 DNA ligase in an appropriate buffer at a temperature of 4°C to 37°C, 
generally in a volume of 50ul or less. 

The chemical synthesis of the DNA polymer or fragments may be carried out by 
conventional phosphotriester, phosphite or phosphoramidite chemistry, using solid phase 
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techniques such as those described in 'Chemical and Enzymatic Synthesis of Gene 
Fragments - A Laboratory Manual' (ed. KG. Gassen and A. Lang), Veriag Chemie, 
Weinheim (1982), or in other scientific publications, for example MJ.Gait, H.W.D. 
Matthes M Singh, B.S. Sproat and R.C. Titmas, Nucleic Acids Research, 1982, 10. 6243; 
B.S. Sproat and W, Bannwarth, Tetrahedron Letters, 1983, 2fL 5771; M.D. Matteucci 
and M.H. Caruthers, Tetrahedron Letters, 1980, 21, 719; M.D. Matteucci and M.H. 
Caruthers, Journal of the American Chemical Society, 1981, 103, 3185; S.P. Adams et 
al, Journal of the American Chemical Society, 1983, 105, 661; N.D. Sinha, J. Biernat, J. 
McMannus and H. Koester, Nucleic Acids Research, 1984, 12, 4539; and H.W.D. 
Matthes etaL, EMBO Journal, 1984, 3, 801. Preferably an automated DNA synthesiser 
(for example, Applied Biosystems 381 A Synthesiser) is employed. 

The DNA polymer is preferably prepared by Iigating two or more DNA molecules 
which together comprise a DNA sequence encoding the polypeptide. 

The DNA molecules may be obtained by the digestion with suitable restriction 
enzymes of vectors carrying the required coding sequences. 

The precise structure of the DNA molecules and the way in which they are 
"obtained depends upon the structure of the desired product. The design of a suitable 
strategy for the construction of the DNA molecule coding for the polypeptide is a routine 
matter for the skilled worker in the art. 

In particular, consideration may be given to the codon usage of the particular host 
cell. The codons may be optimised for high level expression in E. coli using the 
principles set out in Devereux etaL, (1984) NucL Acid Res., 12, 387. 

The expression of the DNA polymer encoding the polypeptide in a recombinant 
host cell may be carried out by means of a replicable expression vector capable, in the 
host cell, of expressing the DNA polymer. The expression vector is novel and also forms 
part of the invention. 

The replicable expression vector may be prepared in accordance with the 
invention, by cleaving a vector compatible with the host ceil to provide a linear DNA 
segment having an intact replicon, and combining said linear segment with one or more 
DNA molecules which, together with said linear segment, encode the polypeptide, under 
Iigating conditions. 

The ligation of the linear segment and more than one DNA molecule may be 
carried out simultaneously or sequentially as desired. 

Thus, the DNA polymer may be preformed or formed during the construction of 
the vector, as desired. The choice of vector will be determined in part by the host cell, 
which may be prokaryotic, such as E. coli, or eukaryotic, such as mouse C127, mouse 
myeloma, chinese hamster ovary, fungi e.g. filamentous fungi or unicellular 'yeast 1 or an 
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insect cell such as Drosophila. The host cell may also be in a transgenic animal. Suitable 
vectors include piasmids, bacteriophages, cosmids and recombinant viruses derived from, 
for example, bacuioviruses or vaccinia. 

The DNA polymer may be assembled into vectors designed for isolation of stable 
transformed mammalian cell lines expressing the fragment e.g. bovine papillomavirus 
vectors in mouse CI 27 cells, or amplified vectors in Chinese hamster ovary cells (DNA 
Cloning Vol. II D JVL Glover ed. IRL Press 1985; Kaufman, R.J. et aL Molecular and 
Cellular Biology 5, 1750-1759, 1985; Paviakis G.N. andHamer, D.H. Proceedings of 
the National Academy of Sciences (USA) 80, 397-401, 1983; Goeddei, D.V. et 
al, European Patent Application No. 0093619, 1983). 

The preparation of the replicable expression vector may be carried out 
conventionally with appropriate enzymes for restriction, polymerisation and ligation of 
the DNA, by procedures described in, for example, Sambrook et al. f cited above. 
Polymerisation and ligation may be performed as described above for the preparation of 
the DNA polymer. Digestion with restriction enzymes may be performed in an 
appropriate buffer at a temperature of 20°-70 C C, generally in a volume of 50jal or less 

with0.1-10ng DNA. 

The recombinant host cell is prepared, in accordance with the invention, by 
transforming a host ceil with a replicable expression vector of the invention under 
transforming conditions. Suitable transforming conditions are conventional and are 
described in, for example, Sambrook et aL, cited above, or "DNA Cloning" Vol. II, D.M. 
Glover ed., IRL Press Ltd, 1985. 

The choice of transforming conditions is determined by the host cell. Thus, a 
bacterial host such as Rcali, may be treated with a solution of CaCb (Cohen et a/.,Proc. 
Nat Acad. Sci., 1973, 69, 21 10) or with a solution comprising a mixture of RbCl, MnCl2, 
potassium acetate and glycerol, and then with 3-[N-morpholino]-propane-sulphonic acid, 
RbCl and glycerol or by electroporation as for example described by Bio-Rad 
Laboratories, Richmond, California, USA, manufacturers of an electroporator. 
Mammalian cells in culture may be transformed by calcium co-precipitation of the vector 
DNA onto the cells or by using cationic liposomes. 

The invention also extends to a host cell transformed with a replicable expression 

vector of the invention. 

Culturing the transformed host cell under conditions permitting expression of the 
DNA polymer is carried out conventionally, as described in, for example, Sambrook et 
aL, and n DNA Cloning' 1 cited above. Thus, preferably the cell is supplied with nutrient 
and cultured at a temperature below 45°C. 
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The protein product is recovered by conventional methods according to the host 
cell. Thus, where the host ceil is bacterial such as £ coli and the protein is expressed 
intracellularly, it may be iysed physically, chemically or enzymatically and the protein 
product isolated from the resulting lysate. Where the host cell is mammalian, die product 
is usually isolated from the nutrient medium. 

Where the host ceil is bacterial, such as £ coli, the product obtained from the 
culture may require folding for optimum functional activity. This is most likely if the 
protein is expressed as inclusion bodies. There are a number of aspects of the isolation 
and folding process that are regarded as important. In particular, the polypeptide is 
preferably partially purified before folding, in order to minimise formation of aggregates 
with contaminating proteins and minimise misfolding of the polypeptide. Thus, the 
removal of contaminating £ coli proteins by specifically isolating the inclusion bodies 
and the subsequent additional purification prior to folding are important aspects of the 
procedure. 

The folding process is carried out in such a way as to minimise aggregation of 
intermediate-folded states of the polypeptide. Thus, careful consideration needs to be 
given to, among others, the salt type and concentration, temperature, protein 
concentration, redox buffer concentrations and duration of folding. The exact condition 
for any given polypeptide generally cannot be predicted and must be determined by 
experiment. 

There are numerous methods available for the folding of proteins from inclusion 
bodies and these are known to the skilled worker in this field. The methods generally 
involve breaking all the disulphide bonds in the inclusion body, for example with 50mM 
2-mercaptoethanol, in the presence of a high concentration of denaturant such as 8M urea 
or 6M guanidine hydrochloride. The next step is to remove these agents to allow folding 
of the proteins to occur. Formation of the disulphide bridges requires an oxidising 
environment and this may be provided in a number of ways, for example by air, or by 
incorporating a suitable redox system, for example a mixture of reduced and oxidised 
glutathione. 

Preferably, the inclusion body is solubilised using 8M urea, in the presence of 
mercaptoethanol, and protein is folded,, after initial removal of contaminating proteins, by 
addition of cold buffer. A preferred buffer is 20mM ethanolamine containing ImM 
reduced glutathione and O.SmM oxidised glutathione. The folding is preferably carried 
out at a temperature in the range 1 to 5°C over a period of 1 to 4 days. 

If any precipitation or aggregation is observed, the aggregated protein can be 
removed in a number of ways, for example by centrifugation or by treatment with 
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precipitant such as ammonium sulphate. Where either of these procedures are adopted 
monomeric polypeptide is the major soluble product. 

If the bacterial cell secretes the protein, folding is not usually necessary. 
Peptide linkages in the derivatives of the invention may be made chemically or 
biosynthetically by expression of appropriate coding DNA sequences, Non peptide 
linkages may be made chemically or enzymatically by post-translational modification. 

The polypeptide portion of the derivatives of the invention may be prepared by 
expression in suitable hosts of modified genes encoding the soluble polypeptide of the 
invention plus one or more peptidic membrane binding elements and optional residues 
such as cysteine to introduce linking groups to facilitate post translational derivatisation 
with additional membrane binding elements. 

The polypeptide portion of the derivative of the invention may include a C- 
terminal cysteine to facilitate post translational modification. Expression in a bacterial 
system is preferred for proteins of moderate size (up to ~70kDa) and with <~8 disuiphide 
bridges. More complex proteins for which a free terminal Cys could cause refolding or 
stability problems may require stable expression in mammalian cell lines (especially 
OHO). This will also be needed if a carbohydrate membrane binding element is to be 
introduced post-translationaiiy. The use of insect ceils infected with recombinant 
baculovirus encoding the polypeptide portion is also a useful general method for 
preparing more complex proteins and will be preferred when it is desired to carry out 
certain post-transiational processes (such as palmitoylation) biosynthetically (see for 
example, M. J.Page er al J.Biol.Chem. 264. 19147-19154, 1989). 

A preferred method of handling proteins C -terminally derivatised with cysteine is 
as a mixed disuiphide with mercaptoethanoi or glutathione or as the 2-nitro, 5- 
caxboxyphenyi thio- derivative as generally described below in Methods. 

Peptide membrane binding elements may be prepared using standard solid state 
synthesis such as the Merrifieid method and this method can be adapted to incorporate 
required non-peptide membrane binding elements such as N-acyl groups derived from 
myristic or palmitic acids at the N terminus of the peptide. In addition activation of an 
amino acid residue for subsequent linkage to a protein can be achieved during chemical 
synthesis of such membrane binding elements. Examples of such activations include 
formation of the mixed 2-pyridyi disuiphide with a cysteine thiol or incorporation of an 
N-haloacetyl group. Peptides can optionally be prepared as the C-terminal amide. 

The derivatives of the invention may utilise a peptide membrane binding element 
comprising one or more derivatisations selected from: 

a terminal cysteine residue optionally activated at the thiol group; 
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an N-haloacetyl group (where halo signifies chlorine, bromine or iodine) located 
at the N-terrninus of the the peptide or at an s-amino group of a lysine residue; 
an amide group at the C -terminus; and 

a fatty acid N-acyl group at the N-terminus or at an s-amino group of a lysine 
residue. 

Chemical bridging groups include those described in EPO 109653 and EPO 152736. 
The bridging group is generally of the formula; 

-A-R-B- (V) 

in which each of A and B, which may be the same or different, represents -CO-, 
_C(=NH? + )-, maieimido, -S- or a bond and R is a bond or a linking group containing one 
or more -(CH?> or meta- or para- disubstituted phenyl units. 

Where the polypeptide portion of the derivative of the invention and a peptidic 
membrane binding element both include a C-terminal cysteine the chemical bridging 
group will take the form -S-S-. Trie bridge is generated by conventional disulphide 
exchange chemistry, by activating a thiol on one polypeptide and reacting the activated 
thiol with a free thiol on the other polypeptide. Such activation procedures make use of 
disulphides which form stable thioiate anions upon cleavage of the S-S linkage and 
include reagents such as 2,2' dithiopyridine and 5 7 5 f -dithio(2-nitrobenzoic acid, DTNB) 
which form intermediate mixed disulphides capable of further reaction with thiols to give 
stable disulphide linkages. 

R may include moieties which interact with water to maintain the water solubility 
of the linkage and suitable moieties include -CO-NH-, -CO-NMe-, -S-S-, -CH(OH)-, 
_SO?-, -CO?-, -(CH2CH?-0) m - and -CH(COOH)- where m is an integer of 2 or more. 

Examples of R include -(CH^r, -(CH 2 )p-S-S-(CH 2 ) q - and 
.(CH2) p .CH(OH)-CH(OH)-(CH2)q- 7 in which r is an infeger of at least 2, preferably at 
least 4 and p and q are independently integers of at least 2. 

The bridging group of formula (V) may be derived from a linking agent of 
formula (VI): 

X-Ri-Y (VI) 

in which Rj is a linking group containing one or more -(CH2)- units and X and Y 
are functional groups reactable with surface amino acid groups, preferably a lysine or 
cysteine group, or the N-terminal amino group, or a protein attachment group. 

Preferred agents are those where X and Y are different, known as 
heterobifunctional agents. Each end of the agent molecule is reacted in turn with each 
polypeptide to be linked in separate reactions. Examples of heterobifunctional agents of 
formula (VI) include: 

N-succinimidyl 3-(2-pyridyldithio) propionate 
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succinimidyl 4-(N-maieimido) caproate 
3-(2-pyridyl) methyl propionimidate hydrochloride 

In each case Y is capable of reacting with a thiol group on a polypeptide, which 
may be a native thiol or one introduced as a protein attachment group. 

The protein attachment group is a functionality derived by modification of a 
polypeptide with a reagent specific for one or more amino acid side chains, and which 
contains a group capable of reacting with a cieavable section on the other polypeptide. 
An example of a protein attachment group is a thiol group. An example of a cieavable 
section is a disulphide bond. Alternatively the cieavable section may comprise an a, (3 
dihydroxy function. 

As an example, the introduction of a free thiol function by reaction of a 
polypeptide with 2-iminothiolane T N-succinimidyl 3-(2~pyridyldithio) propionate (with 
subsequent reduction) or N-acetyl homocysteine thioiactone will permit counline of the 
protein attachment group with a thioi-reactive B structure. Alternatively, the protein 
attachment group can contain a thiol-reactive entity such as the 6-maieimidohexyl group 
or a 2-pyridy[-dithio group which can react with a free thiol in X. Preferably, the protein 
attachment group is derived from protein modifying agents such as 2-iminothiolane that 
react with lysine s-amino groups in proteins. 

When X represents a group capable of reacting directly with the amino acid side 
chain of a protein, it is preferably an N-succinimidyl group.. When X represents a group 
capable of reacting with a protein attachment group, it is preferably a pyridylthio group. 

In the above processes, modification of a polypeptide to introduce a protein 
attachment group is preferably carried out in aqueous buffered media at apH between 3.0 
and 9.0 depending on the reagent used. For a preferred reagent, 2-iminothiolane, the pH 
is preferably 6.5-8.5. The concentration of polypeptide is preferably high (> lOmg/ml) 
and the modifying reagent is used in a moderate (1.1- to 5-fold) molar excess, depending 
on the reactivity of the reagent. The temperature and duration of reaction are preferably 
in the range 0°-40°C and 10 minutes to 7 days. The extent of modification of the 
polypeptide may be determined by assaying for attachment groups introduced. 

Such assays may be standard protein chemical techniques such as titration with 
5,5'-dithiobis-(2-nitrobenzoic acid). Preferably, 0.5-3.0 moles of protein attachment 
group will be introduced on average per mole of polypeptide. The modified polypeptide 
may be separated from excess modifying agents by standard techniques such as dialysis, 
ultrafiltration, gel filtration and solvent or salt precipitation. The intermediate material 
may be stored in frozen solution or lyophilised. 
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Where a protein attachment group is introduced in this way, the bridging sroup 
(V) will be formed from a reaction of the linking agent (VI) and the protein attachment 
group. 

The polypeptides to be linked are reacted separately with the linking aaent or the 
reagent for introducing a protein attachment group by typically adding an excess of the 
reagent to the polypeptide, usually in a neutral or moderately alkaline buffer, and after 
reaction removing low molecular weight materials by gel filtration or dialysis. The 
precise conditions of pH, temperature, buffer and reaction time will depend on the nature 
of the reagent used and the polypeptide to be modified. The polypeptide linkage reaction 
is preferably carried out by mixing the modified polypeptides in neutral buffer in an 
equimolar ratio. Other reaction conditions e.g. time and temperature, should be chosen to 
obtain the desired degree of linkage. If thiol exchange reactions are involved, the 
reaction should preferably be carried out under an atmosphere of nitrogen. Preferably, 
UV-active products are produced (eg from the release of pyridine 2-thione from 2-pyridyl 
dithio derivatives) so that coupling can be monitored. 

After the linkage reaction, the polypeptide conjugate can be isolated by a number 
of chromatographic procedures such as gel filtration, ion-exchange chromatography, 
affinity chromatography or hydrophobic interaction chromatography. These procedures 
my be either low pressure or high performance variants. 

The conjugate may be characterised by a number of techniques including low 
pressure or high performance gel filtration, SDS polyacry tamide gel electrophoresis or 
isoelectric focussing. 

Membrane binding elements which are fatty acid derivatives are attached post 
transiationally to a peptidic membrane binding element, preferably at the terminus of the 
poiypetide chain. Preferably, where the recombinant polypeptide portion of the 
derivative of the invention contains the peptidic membrane binding element, it has a 
unique cysteine for coupling to the fatty acid derivative. Where the recombinant 
polypeptide has a cysteine residue, a thiol-derivative of the fatty acid is added to the 
refolded recombinant protein at a late stage in purification (but not necessarily the final 
stage) and at a reagent concentration preferably below the critical micelle concentration. 
One of the fatty acid derivative and the recombinant peptide will have the thiol group 
activated as described above for thiol interchange reactions. The fatty acid derivative is 
preferably a fatty acyi derivative of an aminoC2-6alkane thiol (optionally C-substituted) 
such as N-(2-myristoyl ) aminoethanethiol orN-myristoyl L-cysteine. 

Suitable examples of hydrophilic synthetic polymers include poiyethyleneglycol 
(PEG), preferably a,co functionalised derivatives, more preferably a-amino , co-carboxy- 
PEG of molecular weight between 400 and 5000 dalcons which are linked to the 
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polypepcide for example by solid-phase synthesis methods (amino group derivatisation) 
or by thiol-interchange chemistry. 

Membrane binding elements derived from ligands of known integral membrane 
proteins, either amino acid sequences or carbohydrates, may be generated by post- 
translational modification using the giycosylation pathways of eukaryotic cells targeted to 
N-Iinked giycosylation sites in the peptide sequence. 

Convenient generic final stage purification strategies are hydrophobic interaction 
chromatography (HIC ) on C2-C8 media and cation exchange chromatography for 
separation of derivatised and underivatised proteins into which a hydrophobic- 
electrostatic switch combination has been inserted. 

In a further aspect, therefore, the invention provides a process for preparing a 
derivative according to the invention which process comprises expressing DNA encoding 
the polypeptide portion of said derivative in a recombinant host cell and recovering the 
product and thereafter post transiationaily modifying the polypeptide to chemically 
introduce membrane binding elements. 

The invention also extends to DNA encoding the polypeptide portion of the 
derivative and to replicable expression vectors and recombinant host cells containing the 
DNA. 

The polypeptide or derivative of this invention is useful in the treatment or 
diagnosis of many complement-mediated or complement-related diseases and disorders 
including, but not limited to, those listed below. 

Disease and Disorders Involving Complement 
Neurological Disorders 
multiple sclerosis 
stroke 

Guiilain Bane Syndrome 
traumatic brain injury 
Parkinson's disease 
allergic encephalitis 
Alzheimer's disease 

Disorders of Inappropriate or Undesirable Complement Activation 
haemodialysis complications 
hyperacute allograft rejection 
xenograft rejection 
corneal graft rejection 
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interfeukin-2 induced toxicity during IL-2 therapy 
paroxysmal nocturnal haemogiobinuria 



Inflammatory Disorders 

inflammation of autoimmune diseases 
Crohn's Disease 

adult respiratory distress syndrome 

thermal injury including burns or frostbite 

uveitis 

psoriasis 

asthma 

acute pancreatitis 
Kawasaki's disease 



Post-Ischemic Reperftision Conditions 
myocardial infarction 
balloon angiooiasrv 

atherosclerosis (cholesterol-induced) & restenosis 
hypertension 

post-pump syndrome in cardiopulmonary bypass or renal haemodialysis 
renal ischemia 
intestinal ischaemia 



Infectious Diseases or Sepsis 
multiple organ failure 
septic shock 



Immune Complex Disorders and Autoimmune Diseases 
rheumatoid arthritis 
systemic lupus erythematosus (SLE) 
* SLE nephritis 
proliferative nephritis 
glomerulonephritis 
haemo lytic anemia 
myasthenia gravis 



Reproductive Disorders 
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antibody- or complement-mediated infertility 
Wound Healing 

The present invention is also directed to a pharmaceutical composition comprising 
a therapeutically effective amount of a polypeptide or derivative, as above, and a 
pharmaceutically acceptable carrier or excipient. 

The Dresent invention also provides a method of treating a disease or disorder 
associated with inflammation or inappropriate complement activation comprising 
administering to a subject in need of such treatment a therapeutically effective amount of 
a polypeptide or derivative of this invention. 

In the above methods, the subject is preferably a human. 

An effective amount of the polypeptide or derivative for the treatment of a disease 
or disorder is in the dose range of 0.01-100mg/kg; preferably O.lmg-lOmg/kg. 

For administration, the polypeptide or derivative should be formulated into an 
appropriate pharmaceutical or therapeutic composition. Such a composition typically 
contains a therapeutically active amount of the polypeptide or derivative and a 
pharmaceutically acceptable excipient or carrier such as saline, buffered saline, dextrose, 
or water. Compositions may also comprise specific stabilising agents such as sugars, 
including mannose and mannitot, and local anaesthetics for injectable compositions, 
including, for example, lidocaine. 

Further provided is the use of a polypeptide or derivative of this invention in the 
manufacture of a medicament for the treatment of a disease or disorder associated with 
inflammation or inappropriate complement activation. 

In order to inhibit complement activation and, at the same time, provide 
thrombolytic therapy, the present invention provides compositions which further 
comprise a therapeutically active amount of a thrombolytic agent. An effective amount 
of a thrombolytic agent is in the dose range of 0.01-10mg/kg; preferably 0.1-5mg/kg. 
Preferred thrombolytic agents include, but are not limited to, streptokinase, human tissue 
type plasminogen activator and urokinase molecules and derivatives, fragments or 
conjugates thereof. The thrombolytic agents may comprise one or more chains that may 
be fused or reversibly linked to other agents to form hybrid molecules (EP-A-0297882 
and EP 155387), such as, for example, urokinase linked to plasmin (EP-A-0152736), a 
fibrinolytic enzyme linked to a water-soluble polymer (EP-A-0183.503). The 
thrombolytic agents may also comprise muteins of plasminogen activators (EP-A- 
0207589). In a preferred embodiment, the thrombolytic agent may comprise a reversibly 
blocked in vitro fibrinolytic enzyme as described in U.S. Patent No. 4,285.932. A most 
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preferred enzyme is a p-anisoyl plasminogen-streptokinase activator complex as 
described in U.S. Patent No. 4,808,405, and marketed under the Trademark EMINASE 
(generic name anistreplase, also referred to as APSAC; Monk et aL, 1987, Drugs 34:25- 
49). 

Routes of administration for the individual or combined therapeutic compositions 
of the present invention include standard routes, such as, for example, intravenous 
infusion or bolus injection. Active complement inhibitors and thrombolytic agents may 
be administered together or sequentially, in any order. 

The present invention also provides a method for treating a thrombotic condition, 
in particular acute myocardial infarction, in a human or non-human animal. This method 
comprises administering to a human or animal in need of this treatment an effective 
amount of a polypeptide or derivative according to this invention and an effective amount 
of a thrombolytic agent 

Also provided is the use of a polypeptide or derivative of this invention and a 
thrombolytic agent in the manufacture of a medicament for the treatment of a thrombotic 
condition in a human or animal. Such methods and uses may be carried out as described 
'in WO 91/05047. 

This invention further provides a method for treating adult respiratory distress 
syndrome (ARDS) in a human or non-human animal. This method comprises 
administering to the patient an effective amount of a polypeptide or derivative according 
to this invention. 

The invention also provides a method of delaying hyperacute allograft or 
hyperacute xenograft rejection in a human or non-human animal which receives a 
transplant by administering an effective amount of a polypeptide or derivative according 
to this invention. Such administration may be to the patient or by application to the 
transplant prior to implantation. 

The invention yet further provides a method of treating wounds in a human or 
non-human animal by administering by either topical or parenteral e.g. intravenous 
routes, an effective amount of a polypeptide or derivative according to this invention. 

GENERAL METHODS USED IN EXAMPLES 

(i) DNA Cleavage 

Cleavage of DNA by restriction endonucleases was carried out according to the 
manufacturer's instructions using supplied buffers. Double digests were carried out 
simultaneously if the buffer conditions were suitable for both enzymes. Otherwise 
double digests were carried out sequentially where the enzyme requiring the lowest salt 
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condition was added first to the digest. Once the digest was complete the salt 
concentration was altered and the second enzyme added. 

(ii) DNA ligation 

Ligations were carried out using T4 DNA ligase purchased from Promega, as 
described in Sambrook et at, (1989) Molecular Cloning: A Laboratory Manual 2nd 
Edition. Cold Spring Harbour Laboratory Press. 

(iii) Plasmid isolation 

Plasmid isolation was using Promega Wizard™ Plus Minipreps or Qiagen 
Plasmid Maxi kit according to the manufacturers instructions. 

(iv) DNA fragment isolation 

DNA fragments were excised from agarose geis and DNA extracted using the 
QIAEX gei extraction kit or Qiaquick, or GeneClean gel extraction kits according to the 
manufacturer's instructions (QIAGEN Inc., USA, Bio 101 Inc, USA). 

(v) Introduction of DNA into E. coli 

Plasmids were transformed into E. coli BL21(DE3) (Studier and Moffat, (1986), 
J. MoL Biol 189:1 13) that had been made competent using calcium chloride as described 
in Sambrook et at, (1989). E coli JM1Q9 and XLl-blue strains were purchased as a 
frozen competent culture from Promega. 

(vi) DNA sequencing 

Plasmid DNA is sequenced on a Vistra DNA Labstation 625. The sequencing 
chemistry is performed using Amersham International's Thermo Sequenase fluorescent 
dye-terminator cycle sequencing kit' (RPN 2435), in conjunction with their 'FMP 
fluorescent dye-terminator precipitation kit' (RPN 2433) according to the manufacturer's 
instructions. 

The sequences produced by the above procedure are analysed by a Perkin Elmer 
ABI Prism 377 DNA Sequencer. This is an eiectrophoretic technique using 36 cm x 
0.2mm 4% acrylamide gels, the fluorescentiy labeled DNA fragments being detected by a 
charge coupled device camera according to the manufacturer's instructions. 

(vii) Production of oligonucleotides 
Oligonucleotides were purchased from Cruachem. 
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(viii) pBROC413 

The plasmid pT7-7 [Tabor, S (1990), Current Protocols in Molecular Biology, F. 
A. AusubeL Brent R. E. Kingston, D. D. Moore, J. G. Seidman, J. A. Smith, and K. 
Struhheds.] pp.16-2.l-16-2.il, Greene Publishing and Wiley-Interscience,New York.] 
contains DNA corresponding to nucleotides 2065-4362 of pBR322 and like pBR322 can 
be mobilized by a conjugative plasmid in the presence of a third plasmid ColK. A 
mobility protein encoded by ColK acts on the nic site at nucleotide 2254 of pBR322 
initiating mobilization from this point. pT7-7 was digested with Lspl and BglH and the 
protruding 5* ends filled in with the Klenow fragment of DNA Polymerase! The 
plasmid DNA fragment was purified by agarose gei electrophoresis, the blunt ends 
ligated together and transformed into £ coli DH1 by electroporation using a Bio-Rad 
Gene Puiser and following the manufacturers recommended conditions. The resultant 
plasmid pBROC41 3 was identified by restriction enzyme analysis of plasmid DNA, 

The deletion in pBROC413 from the Lspl site immediately upstream of the f 10 
promoter to the BgfLl site at nucleotide 434 of pT7-7 deletes the DNA corresponding to 
nucleotides 2065-2297 of pBR322. The nic site and adjacent sequences are therefore 
deleted making pBROC413 non mobilizable. 

(Lx) Sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS PAGE) 

SDS PAGE was carried out generally using the Novex system (British 
Biotechnology) according to the manufacturer's instructions. Prepacked gels of 
acryiamide concentrations 4 - 20% were most frequendy used. Samples for 
electrophoresis, including protein molecular weight standards (for example LMW Kit, 
Pharmacia or Novex Mark 12) were usually diluted in l%(w/v)SDS - containing buffer 
(with or without 5%(v/v) 2-mercaptoethanoi), and left at room temperature for about 0.5 
h before application to the gel. 

(x) Reduction of disulphides and modification of thiols in proteins 

There are a number of methods used for achieving the title goals. The reasons it 
may be necessary to carry out selective reduction of disulphides is that during refolding, 
concentration and further purification of multi-thioi proteins inappropriate disulphide 
pairing can occur. In addition, even if correct disulphide paring does occur, it is possible 
that a free cysteine in the protein may become blocked with the reducing agent, for 
example glutathione. These derivatives are generally quite stable. In order to make 
them more reactive, for example for subsequent conjugation to another functional group, 
they need to be selectively reduced, with for example dithiothreitol (DTT) or with Tris 
(2-carboxyethyl) phosphine.HCl (TCEP) then optionally modified with a function which 
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is moderately unstable. An example of the latter is Ellmans reagent (DTNB) which gives 
a mixed disulphide. In the case where treatment with DTNB is omitted, careful attention 
to experimental design is necessary to ensure that dimerisation of the free thiol-conrainincr 
protein is minimised. Reference to the term 'selectively reduced' above means that 
reaction conditions eg. duration, temperature, molar ratios of reactants have to be 
carefully controlled so that disulphide bridges within the natural architecture of the 
protein are not reduced. All the reagents are commercially available eg. from Sisma or 
Pierce. 

The following general examples illustrate the type of conditions that may be used 
and that are useful for the generation of free thiols and their optional modification. The 
specific reaction conditions to achieve optimal thiol reduction and/or modification are 
ideally determined for each protein batch. 

TCEP may be prepared as a 20mM solution in 50mM Hepes (approx. pH 4.5) and 
may be stored at -40 degrees C. DTT may be prepared at 1 OmM in sodium phosphate 
pH 7.0 and may be stored at -40 degrees C. DTNB may be prepared at lOmM in sodium 
phosphate pH 7.0 and may be stored at -40 degrees C. All of the above reagents are 
'typically used at molar equivalence or molar excess, the precise concentrations ideally 
identified experimentally. The duration and the temperature of the reaction are similarly 
determined experimentally. Generally the duration would be in the range 1 to 24 hours 
and the temperature would be in the range 2 to 30 degrees C. Excess reagent may be 
conveniently removed by buffer exchange, for example using Sephadex G25. A suitable 
buffer is 0.1 M sodium phosphate pH7.0. 

EXAMPLES 

Example 1 Expression and isolation of CM7 (SEQ ID 1) 

(a) Construction of plasmid pBrocSCRl-3CM7 encoding CM7 

CM7 consists of the -short consensus repeats 1 and 2 from the CRl gene fused to 
the sequence of SCR3 from the CRi-Iike gene. The sequence of the DNA encoding CM7 
is shown in seq ID No2. It was constructed using the plasmid coding for SCR1-3 (MQ I - 
> K196) of CR-I, pDB1013-5 (Ref: patent application WO 94/00571). PDB1013-5 was 
subjected to site directed mutagenesis using a QuickChange kit suppled by Stratagene. 
Three pairs of oligonucleotides were utilised to introduce ten amino acid changes changes 
to the native SCR3 sequence corresponding to the changes observed in the CR-1 like 
pseudogene sequence described'by Hourcade et al 1990 (Journal of Biological Chemistry 
265, pp 974-980): Each pair of oligonucleotides is complementary in sequence. Changes 
are shown in lower case. 
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The first pair: 

CGACCAT CgCCAACGGTGATTTC AcCTCTAtCAgTCGCGAGtATTTTCAC 
(SEQ ID No. 3) and 

GTGAAAATaCTCGCGAcTGaTAGAGgTGAAATCACCGTTGGcGATGGTCG 
(SEQ ID No. 4 ) 

resulted in five amino acid changes and loss of an Apol restriction site. 
The second pair: 

GACCTACCaCTGCAATCtGGGTAGCcGTGGTaaaAAGGTGTTTGAGC 
(SEQ ID No. 5) and 

GCTCAAACACCTTtttACCACgGCTACCCaGATTGCAGtGGTAGGTC 
(SEQ ID No. 6) 

resulted in four amino acid changes and the acquisition of a BsaJT restriction site. 
The third pair: 

GC ACTAGc AAaGAC G ATC AAGTGGG (SEQ ID No. 7) and 

CCCACTTGATCGTCt TTgCTAGTGC (SEQ ID No.8) 

resulted in a single amino acid change and the loss of a Spel restriction site. 

To generate DNA encoding CM7 ail six oligonucleotides were used 
simultaneously in the mutagenesis reaction and transformed into competent XL 1 -Blue E. 
coli (Stratagene). Resulting colonies were grown up in LB roth and plasmids extracted 
using standard methodology. The plasmids were screened for sucessfui mutagenesis by 
the loss of the Apol or Spel restriction sites or the acquisition of a new BsaJI restriction 
site. In the first experiment only the oligonucleotide pair (SEQ ID Nos. 7 and 8) were 
incorporated resulting in the loss of the Spel restriction site. This plasmid (pBrocSCRl- 
3P3) was subjected to further rounds of site-directed mutagenesis using the 
oligonucleotides SEQ ID Nos 3,4, 5 and 6 . From this was produced the mutated 
plasmid pBrocSCl-3P7 containing all ten amino acid coding changes in the SCR3 coding 
domain. 

Using the restriction enzymes EcoRI and Hindlll, the mutated SCR3 domain was 
excised from the plasmid pBrocSCRl-3P7 as a 229 base pair fragment and iigated back 
into a 2540 base pair EcoRI/Hindlll fragment of pDB 103 1-5 containing vector and 
SCR1-2 sequences to minimise the possibility of unwanted mutations elsewhere in the 
plasmid. The resulting plasmid was pBrocSCRl-3CM7 encoding the protein CM7 (SEQ 
ID NO: 1). 

(b) Expression of CM7 from pBrocSCRl-3CM7 

pBrocSCRl-3CM7 was transformed into calcium chloride competent E. coli 
8L21(DE3) and resultant colonies were isolated and checked for plasmid content. To 
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express protein from pBrocSCRI-3CM7 in£ coli BL21(DE3) ? a single colony was 
inoculated into 10 ml LB-phosphate media (20g/L tryptone, I5g/L yeast extract, 0.8g/L 
NaCL 0.2g/L Na 2 HP04, 0. ig/L KH2PO4) containing 50ug/ml ampicillin. The culture 
was grown for 6 hours at 37°C, 230 r.p.m. before being used to inoculate 100 ml of the 
same media containing 50 ug/ml ampicillin. Growth was under the same conditions 
overnight. 25 mi of each culture were then used to inoculate 600 ml of the same media 
with 50 ugfinl ampicillin in 3 L Erienmeyer flasks. Cells were grown to an OD of about 
0.25 at A^QO nm - (isopropyl B-D galactopyranoside) was added to a final 

concentration of 1 mM and ceils allowed to continue growth for a further about 8 hours 
before harvesting by centrifugation at 8000 g/10 min. Pellet from 2L of culture was 
stored at -4G°C 

(c) Isolation, refolding, purification and formulation of CM7 

The methods described are essentially those detailed in Dodd I. et al (1995) 
Protein Expression and Purification 6 727-736 with some modifications. 

(i) Isolation of solubilised inclusion bodies 

The frozen cell peliet of £ coli BL21(DE3) (pBrocSCRl-3CM7) was allowed to 
thaw at room temperature for 2h and resuspended in 50 mM Tris/50 mM NaCI/1 mM 
EDTA pH 8.0 (approx. 6GmI). The suspension was transferred to a glass beaker 
surrounded by ice and sonicated (Heat systems - Ultrasonics W380; 50 x 50% pulse, 
pulse time = 5 sec.) for 3 min. and then spun at 7000 rpm for 20 min. The supernatant 
was decanted and discarded The pellet was resuspended in 20 mM Tris/8M urea/1 mM 
EDTA/50 mM 2-mercaptoethanol pH 8.5 (80ml) at room temperature by vigorous 
swirling, then left for lh at room temperature with occasional swiriing. 

(ii) Initial purification using SP-Sepharose 

To the viscous solution was added SP-Sepharose FF (30g wet weight) that had 
been water washed and suction-dried. The mixture was swirled vigorously and left static 
for l~2h at room temperature. The supernatant was decanted, sampled and discarded. 
The remaining slurry was resuspended to a uniform suspension and poured into a glass 
jacket and allowed to settle into a packed bed. The column was equilibrated with 0.02M 
Tris/8M urea/0.05M 2-mercaptoethanol/0.Q0I M EDTA pH 8.5.at room temperature. 
When the A28O of the eluate had stabilised at baseline, the buffer was changed to 
equilibration buffer additionally containing 1M NaCl. A single A230 peak was eiuted by 
the 1M NaCi-containing buffer; the volume was approx. 40mi. The protein 
concentration of a sample of this solution that had been buffer-exchanged (Sephadex 
G25) into 50 mM formic acid was estimated by A?gG determination, using a molar 
extinction coefficient of 25000 cm* 1 and the formula A = ECL where A is the absorption 
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of the solution at 280nm, E is the moiar extinction coefficient. C is the molar 
concentration of protein and L is the light path in cm. This showed the product had a 
protein concentration of 0.44mg/ml. SDS PAGE showed that the product contained 
apparently two major species, both win a molecular weight around 22 000. The solution 
was stored at -40°C 

(iii) Folding and further processing 

The 40mi SP-Sepharose-purified product was added gradually over a 1 min period 
to 1240 mi freshly prepared, cold 0.06 M ethanolamine/lmM EDTA with continuous 
swirling, and left static for I h/4°C. Reduced glutathione (GSH) was added to 1 mM and 
oxidised glutathione (GSSG) was added to 0.5 mM by the addition of 100-fold 
concentrates of both solutions. The solution was clear and was left static approx 2-3 °C 
for 5d. The solution was then uitrafiltered using a YM1 0 membrane to a final retentate 
volume of 59 mi; the retentate was clear and it was mixed with 9 vol. 0.1M 
NaH?P04/lM (NH^oSQ^ pH 7.0 (Buffer A) at room temperature and immediately 
centrifuged at 3000 rpm for 20 min. The supernatant was applied to a column 
(i.d.,26mm, h., 100mm) of Butyl Toyopearl 650M and the column was developed using a 
linear gradient of Buffer A to 0.1M sodium phosphate pH7.0. All the chromotography 
was at room temperature at 2 ml / min. A single A2S0-peak was noted during the 
gradient. The fractions spanning the peak were pooled and were uitrafiltered (YM10) to 
2.5ml. This retentate was regarded as the product. The product contained one major 
species by non-reduced SDS PAGE with an estimated purity of >95% and an apparent 
molecular weight of 21 000. A sample of the product diluted 1 0-fold in 0. 1 M sodium 
phosphate pH7,0 had an A280 of 0.47; using an extinction coefficient of 34000 allowed 
the protein concentration to be calculated as 140uM. C18 Poros HPLC using a 
acetonitriie gradient in 0.08% TFA gave a single A2 15 peak with an estimated purity of 
approx. 99%. Electrospray mass spectrophotometry gave a mass of 21887. 

Example 2 Construction of plasmids pBrocSCRl-3CMl, 2 7 3, 5 and 6 encoding 
CM1, CM2, CM3, CMS and CM6 

These constructs contain some but not all of the CRi-like gene changes in the 
SCR3 region of SCR1-3 of CR1. The plasmids pBrocSCRl-3Pl, 2, 3, 5 and 6 were 
generated by site directed mutagenesis using the method described in Example la) using 
different combinations of oligonucleotide pairs: 

pBrocSCRi-3Pl was made using only SEQ ID Nos 3 and 4 

pBrocSCRl-3P2 was made using only SEQ ID Nos 5 and 6 

pBrocSCRI-3P3 was made using only SEQ ID Nos 7 and 8 

pBrocSCRl-3P5 was made using SEQ ID Nos 5 to 8 and 
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pBrocSCRl-3P6 was made using SEQ ID Nos 3, 4, 7 and 8 
Plasmids pBrocSCRI-3CMi, 2, 3, 5 and 6 were generated by excision of 229 base pair 
fragments encoding the mutated SCR3 domains from plasmids pBrocSCRI-3Pl to 
pBrocSCRl-3P6 and the fragments were iigated back into a 2540 base pair 
EcoRI/Hindlll fragment of pDBl 013-5 containing vector and SCR1-2 sequences to 
minimise the possibility of unwanted mutations elsewhere in the plasmid. The resulting 
plasmids were pBrocSCRI-3CMl, 2, 3, 5 and 6, encoding the proteins CMl, CM2, CM3, 
CM5 and CM6 (SEQ ID NOs 9, 1 1, 13, 15 and 17, respectively). The DNA sequences 
encoding these polypeptides is shown in Sequence ID Nos. 10,12,14,16 and 18. 

CMl, CM2, CM3, CM5 and CM6 were expressed in£. coli using methods 
similar to those described for CM7 in Example 1 . Analysis of total cell lysate by SDS 
PAGE followed by staining for protein showed that the IPTG treatment led to marked 
expression of a polypeptide with a molecular weight of about 22 000, which was 
indistinguishable from CM7 analysed in the same way. The whole ceil pellet may be 
stored at -40 degrees C or below until required. 

The E. coli ceil pellet can be processed in exactly the same way as described for 
CM7 to obtain purified, concentrated protein for further evaluation. 

Example 3 

(i) Construction of plasmids pBroc3CR3CM8, 9, 10, 12, 13 and 14 encoding CM8, 
CM9, CM10, CM12, CM13 and CM14 (SEQ ED NOs: 19, 21, 23, 25, 27 and 29) 
These constructs comprise a single SCR corresponding to SCR3 of CR1 and 
containing varying numbers of changes to CRl-like sequences. CM 14 is therefore SCR3 
from the CRl-like gene. The plasmids pBrocSCR3CM8, 9, 10, 12, 13 and 14 were 
generated from the plasmids pBrocSCRl-3Pl, 2, 3, 5, 6 and 7 and the plasmid pBroc435 
(Ref: patent application WO 94/00571). Digestion of pBrocSCRl-3Pi, 2, 3, 5, 6 and 7 
with the restriction enzymes EcoRI and Hindlll yielded fragments of 229 base pairs 
encoding the mutated SCR3 domains from pBrocSCRi-3Pl, 2, 3, 5, 6 and 7, and these 
were iigated with a EcoRI/Hindlll fragment of 2171 base pairs from pBroc435 containing 
vector and promoter sequences. The resulting plasmids were pBrocSCR3CM8, 9, 10, 12, 
13 and 14, encoding the proteins CMS, CM9, CM10, CM12, CM13 and CM14 (SEQ ID 
NOs: 19, 21, 23, 25, 27 and 29, respectively), i.e., mutant SCR3 domains identical in 
sequence to the SCR3 domains of CMl, CM2, CM3, CM5, CM6 and CM7. The DNA 
sequences encoding CM8, CM9, CM 10, CM 12, CM 13 and CM 14 are given in sequence 
ID Nos 20, 22, 24, 26, 28 and 30. 
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CMS, CM9, CM10. CM12 andCMIS may be expressed in £. coli using methods 
similar to those described for CM7 in Example 1 . The whole ceil pellet may be stored at 
-40 degrees C or below until required. 

The E. coli cell pellet can be processed in a similar way to that described for CM7 
ceil pellet to obtain purified, concentrated protein for further evaluation. More 
preferably, modifications may be made to the isolation protocol, for example those 
described in Clark, N.S, 1996 (Ph. D. Thesis, Southampton University). 

(ii) Expression and isolation of protein CM14 (SEQ ID NO:29) 
Using methods similar to those described for CM7 in Example 1 plasmid 
pBrocSCR3CM14 was used to transform competent Ecoli BL21(DE3) strain and 
CM 14 protein was expressed The whole cell pellet was stored at -40 degrees C until use. 

CM 14 protein was isolated from the E. coli ceil pellet initially in a similar way to 
that described for CM7. In brief, the inclusion bodies were isolated and they were 
solubilised in fully reducing buffer 40g ammonium sulphate was added to 200ml of the 
solubilsed inclusion bodies; after stirring for 2h at RT the 0 - 20% precipitate was 
isolated by centrifugation. This precipitate was solubilised in 8M urea / 50mM 2- 
mercaptoethanoi - containing buffer; the preparation contained a major protein band by 
SDS PAGE under non-reducing conditions with an apparent molecular weight of about 
7000. 

Example 4 Expression and isolation of CM7/cys (SEQ ID NO 31) 
Construction of plasmid pBrocSCRl-3CM7mutcys encoding CM7/cys. 
The plasmid pBrocSCRl-3CM7mutcys was produced by site directed mutagenesis of 
pBrocSCRl-3CM7 using methods similar to those described in example 1. A pair of 
oligonucleotides with complementary sequence were used in which five changes to the 
sequence of pBrocSCRl-3CM7 had been introduced. Two of the changes introduced a 
unique Apal restriction enzyme site without altering the amino sequence and three 
changes introduced a cysteine codon immediately prior to the stop codon. The sequences 
of the oiteonuclotides used for the mutagenesis were as follows: Seq ID No. 32 
CTGGAGCGGgCCcGCACCGCAGTGCATCATCCCGAACAAAtgcTAATAAAAGC, 

Seq ID No. 33 

GCTTTTATTAgcaTrTGTTCGGGATGATGCACTGCGGTGCgGGcCCGCTCCAG. 
Following site directed mutagenesis and transfomation into competent Ecoli, the 
resulting colonies were analysed by restriction enryme digestion for the introduction of 
the new Apal restriction site. DNA sequencing confirmed that the encoded amino acid 
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sequence had been altered fay the addition of a single C-terminal cysteine residue to <*ive 
SeqIDNo3I. 

(ii) Expression and isolation of CM7/cys protein 

CM7/cys was expressed in £ coli transformed with pBrocSCRl- 
3CM7mutcys using methods identical to those described for CM7 in Example 1 . The 
cell pellet from 1L was stored at -40 degrees C until use. 

The £1 coli cell pellet was processed in exactly the same way as described for 
CM7 to obtain purified, concentrated protein for further evaluation. The final formulated 
protein product - the ultrafiltered retentate of the Butyl Toyopearl-eluted fractions - 
contained 12mg protein based on A2S0 determination using an extinction coefficient of 
34000 and had an apparent molecular weight and purity on SDS PAGE gels (non- 
reduced) of 20000 and about 80% respectively. 

Example 5 Preparation of [CM7]-Cys-S-S-[MSWP~1] (PM-9) (SEQ ID No. 34) 

H 2 N-[CM7}-Cys-OH 



N-{myristoyi)-GIy-Ser-Ser-Lys-Ser-Pro^ 

(i) Myristoyl/Electrostatic Switch Peptide Reagent 1 (MSWP-1) 
N-(Myristoyi) -€Iy-Ser-Ser-Lys-Ser-Pro-Ser-Ly5-Lys-Ly5-Lys-Lys-Lys-Pro-Gly-Asp-(S-2- 
Thiopyrtdyi)Cys-NH 2 

The peptide: 

Gly-Ser-Ser-Lys-Ser-Pro-Ser-Lys-Lys-Lys-Lys-Lys-Lys-Pro-Gly-Asp-Cys-NH? 
(SEQ ID NO 35) 

was prepared using solid phase synthesis via the general Fmoc/tBu strategy developed by 
Sheppard and Atherton (E.Atherton and R.CSheppard, Solid Phase Synthesis, IRL Press, 
Oxford, 1989). Kieselguhr-supported poiydimethylacrylamide resin (Macrosorfa 100) 
was used as the solid support and was derivatised with ethylene diamine. 

Coupling reactions were carried out using N-a-Fmoc protected reagents pre- 
activated with N,N'-diisopropyicarbodiimide/ N-hydroxybenzotriazoie (in 4-fold molar 
excess) with bromophenol blue monitoring. Fnioc cleavages used 20% ptperidine in 
DMF. Reactions to assemble the peptide chain were carried out by repeated cycles of 
coupling and deprotection including the attachment of the modified Rink linkage reagent 
(p-[(R.S)-a'[i-(9H-fluoreny-9-yl-methoxyformamido ] 2.4 dimethoxybenzyl]- 
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phenoxyacetic acid) designed to yield a C -terminal amide on final cleavage. The side 
chain functionalities of the individual amino-acids were protected as follows: 
Ser (tButyl), Lys (Boc), Asp (O-tButyl), Cys (Trityl). 
On completion of the peptide assembly and with the peptide still attached to the 
resin, the myristoyi group was attached to the amino group of the N terminal glycine fay 
direct coupling of myristic acid by the same activation procedure. This modified peptide 
was then cleaved from the resin and the side-chain protecting groups removed at the same 
time by treatment with trifluoracetic acid containing 2.5% water and 2.5% triisopropyi 
silane. 

The crude product was treated with 2,2' dithiopyridine in 0.0 1M ammonium 
acetate solution at pH 8-9 for approx. 2h, then acidified with acetic acid and purified by 
preparative high performance liquid chromatography (HPLC) in 0. 1% trifluoracetic acid 
(TFA) /water and 0. 1% TFA/acetonitrile as gradient component. After lyophilisation, the 
peptide was a white amorphous powder, soluble to at least lOmg/mi in 
dimethyisuiphoxide. Fast atom bombardment mass spectrometry gave main peaks at m/e 
2107.8, 2129.7 and 2145,8, corresponding to the monoprotonated, monosodiated and 
monopotassiated molecular ions of the peptide. The 2-thiopyridyI content of the peptide 
was measured by dissolving it to around G.03mM to 0.2 mM in 0. 1M Sodium Borate pH 
8.0 and reducing by addition of dithiothreitol to 5mM. The change in optical density at 
343nm was used to calculate the amount of pyridine 2-thione released using an 
extinction coefficient at this wavelength of 8080 cm* 1 M" 1 . This indicated that the 
peptide content was approximately 60% of the dry weight. 

(ii) Synthesis of PM9 

CM7/cys from Example 4 (12uM; 0. 1ml) was mixed with TCEP (Tris-(2- 
carboxyethyl) phosphine ) (5mM; 0.001ml) and incubated at room temperature for 18 h. 
0.01ml of 500mM ethanolamine was added and mixed. MSWP-1 from (i) (2mM in 0. 1M 
sodium phosphate pH 7.0; 0.00 14ml) was added and the solution incubated for a further 4 
h. The prodcuct was analysed by SDS PAGE and showed two primary bands under 
non-reducing conditions, the major one with an apparent molecular weight of 20000 and 
a minor one of about 22000, the latter consistent with the formation of target PM9. 

Example 6 Expression and isolation of protein CM15/cys (SEQ ID NO:36) 

CM15-cys(sEQ id NO:3S) comprises SCRi -3 with all amino acid changes 
corresponding to the CR-1 like pseudogene sequence described by Hourcade et al 1990 
(Journal of Biological Chemistry 265, pp 974-980) where they appear in the region 
homologous to SCRI -3 (13 amino acid changes in addition to the 10 changes in 
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CM7)with an additional C-terminal cysteine residue. Plasrnid pBrocSCRI-3CMl5-cys 
was generated by site-directed mutagenesis of pBrocSCRI-3CM7/cys (Example 4). 
Oligonucleotides 20 to 74 bases in length and spanning the regions of change on both 
strands were synthesised, in which minimal base were introduced to alter the amino acid 
sequence and generate or delete restriction sites for diagnostic purposes, pBrocSCRl- 
3CM15-cys was constructed using these oligonucleotides and methodology similar to 
that used to construct pBrocSCRl-3CM7, as described in Example la). 
Six pairs of oligonucleotides were utilised to introduce thirteen amino acid changes 
changes to the native SCR1 and SCR2 sequence corresponding to the changes observed 
in the CR-1 like pseudogene sequence described by Hourcade et al 1990 (Journal of 
Biological Chemistry 265, pp 974-980); Each pair of oligonucleotides is complementary 
in sequence. Changes are shown in lower case. 
The first pair: 

C AGTG C AAC G tGC C GGAATGG (SEQ ID No. 37) and 
CCATTCCGGAaCGTTGCACTG (SEQ ID No. 38) 

resulted in one amino acid changes and the insertion of aP^p!406I restriction sitQ. 

The second pair: GACTGATGAtTTTGAGTTCC (SEQ ID No. 39) and 
GGAACTCAAAaTCATCAGTC (SEQ ID No. 40) 
resulted in one amino acid change and the loss of an Apol restriction site. 

The third pair: 

GTCTGGACTaGTGCTAAGGACaagTGCaaACGTAAATCTTGTCG (SEQ ID No. 41) 
and 

CGACAAGATTTACGTctGCActtGTCCTTAGCACtAGTCCAGAC (SEQ ID No. 42) 
resulted in three amino acid changes and the insertion of a Spel restriction site. 
The fourth pair: 

CGGCATGGcGCATGTGATCAAAGatATCCAGTTCcGaTCgCAAATTAAATATTCT 
TGTcCTAAgGGTTACCGTC (SEQ ID No 43) and 
GACGGTAACCcTTAGgACAAGAATATTTAATTTC 
GATCACATGCgCCATGCCG (SEQ ID No 44) 

resulted in four amino acid changes and the insertion of EcoKV, Pvul and Bsu36I 
restriction sites. 

The fifth pair: 

CATCTCTGGTaATACTGTCATTTGGGATAATaAAACACCGgTTTGTGACC (SEQ 
ID No. 45) and 

GGTCACAAAcCGGTGTTTtATTATCCCAA.ATGACAGTATtACCAGAGATG (SEQ 
ID No. 46) 
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resulted in three amino acid changes and the insertion of an Agel restriction site 

The sixth pair: 
GACCGAATTatcTGTGGTCTG (SEQ ID No. 47) and 
CAGACCACAgatAATTCGGTC (SEQ ID No. 48) 

resulted in one amino acid change and the loss of EcoRI and Apol restriction sites. 

To generate DNA encoding CM15/cys ail twelve oligonucleotides were used 
simultaneously in the mutagenesis reaction and transformed into competent XL 1 -Blue 
coli (Stratagene). Resulting colonies were grown up in LBroth and plasmids extracted 
using standard methodology. The plasmids were screened for sucessful mutagenesis the 
restriction site changes. In the first experiment four of the oligonucleotide pairs were 
incorporated (SEQ ID Nos 37/33, 39/40, 41/42, 43/44) were incorporated. This piasmid 
(pBrocSCRI -3CM2 1 -cys) was subjected to further rounds of site-directed mutagenesis 
using the oligonucleotides SEQ ID Nos 45/46 and 47/48. From this was produced the 
mutated piasmid pBrocSCl-3CM15-cys containing all thirteen amino acid coding 
-changes in the SCR1 and SCR2 coding domain. The sequence was confirmed by DNA 
sequencing. In addition a piasmid encoding SCRl-3/cys with all the pseudogene changes 
with the exception of the Isoleucinel24 Proline substitution at the hinge region separating 
SCR2 and SCR3 as constructed from pBrocSCRl-3CM2t-cys by site directed 
mutagenesis with oligonucleotides Seq ID Nos 45 and 46. 

CM15/cys protein was expressed from pBrocSCRl-3CM15/cys in£. coli using 
methods similar to those described for CM7 in Example 1. The whole cell peilet was 
stored at -40 degrees C until use. 

CM15/cys protein was isolated from the £. coli cell peilet in exactly the same way 
as described for CM7 to obtain purified, concentrated CM15/cys protein for further 
evaluation. The final preparation contained a major protein band by SDS PAGE under 
non-reducing conditions with an apparent molecular weight of about 20000. 

Example 7 Preparation of [CM15I-Cys-S-S-[MSWP-1] (SEQ ID NO:49) 

H 2 N-[CM151-Cys-OH 

S 

1 

S 

I 

N-(myristoyl)-Gly-Ser-Ser-Lys-Ser-Pro-Ser~^ 
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The title compound is prepared from CMlS/cys by procedures described in Example 5. 

Example 8 Expression and isolation of protein CM16/cys (SEQ ID NO:50) 

CM16/cys is CMlS/cys with the modification II24P(i.e wild-type sequence at the 
hinge region between SCR2 and SCR3) The plasmidpBrocSCRl-3CM16cys was 
generated as described in example 6. 

CM16/cys protein was expressed from pBrocSCRl-3CM16/cys in £ coli using 
methods similar to those described for CM7 in Example 1. The whole ceil pellet was 
stored at -40 degrees C until use. 

CMI6/cys protein was isolated from the £ coli cell pellet in a similar way to that 
described for CM15/cys 7 although on one occasion the protein was precipitated by the 
ammonium sulphate treatment. The final preparation contained a major protein band by 
SDS PAGE under non-reducing conditions with an apparent molecular weight of about 
20000. 

Example 9 Preparation of [CM16J-Cys-S-S-[M3WP-1] (SEQ ID NO:51) 

H 2 N-[CM161-Cys-OH 



N-(myri5toyI)-Gly-Ser-Ser-Lys^er-Pro^er-Lys-Lys-Lys-Lys-Lys-Lys-Pro-Gly-Asp-Cys-NH^ 
The title compound is prepared from CMi6/cys by procedures described in Example 5. 

Example 10 Construction of plasmid pBrocSCRl-3CM7rgdcys encoding 
CM7/rgdcys. 

This construct consists of the sequence of CM7 modified at the C-terminus of the protein 
to contain an RGD sequence as a ligand for Glyoprotein Ilb/IIIa of platelets . 
The plasmid pBrocSCRI-3CM7rgdcys was produced by restriction enzyme digestion of 
pBrocSCR!OCM7mutcys with Apal and HindlllvnA purification of the large fragment. 
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Two oligonucleotides were annealed in vitro and ligated into this fragment. The sequence 
of the inserted oligonucleotides was as follows. Seq ID No. 52 

CGCACCGCAGTGCATCATCCCGAACAAAGATGGCCCGAGCGAAATTCTGCGT 
GGCGATTTTAGCAGCTGCTA and Seq ID No. 53: 

AGCTTAGCAGCTGCTAAAATCGCCACGCAGAATTTCGCTCGGGCCATCTTTGT 
TCGGGATGATGCACTGCGGTGCGGGCC. The amino acid sequence of the encoded 
protein is shown in Sequence ID No. 54. The resulting colonies were analysed by 
restriction enzyme digestion and confirmed by DNA sequencing. 

Example 11 Construction of plasmid pBrocSCRl-3CM7TceIl encoding CM7/Tcell. 

This construct consists of the sequence of CM7 fused at the C -terminus to an extension 
designed to target the protein to the T-cell receptor alpha subunit. 
The plasmid pBrocSCRl~3CM7Tceli was produced by restriction enzyme digestion of 
pBrocSCRl-3CM7mutcys with Apalznd Hindlllmd purification of the large fragment. 
Two oligonucleotides were annealed in vitro and ligated into this fragment. The sequence 
of the inserted oligonucleotides was as follows. Seq ID No. 55 

CGCACCGCAGTGCATCATCCCGAACAAAGCGGCGCCCAGCGTGATTGGCTTC 
CGTATTCTGCTGCTGAAAGTGGCGGGCTGATA and Seq ID No. 56: 

AGCTTATCAGCCCGCCACTTTCAGCAGCAGAATACGGAAGCCAATCACGCTG 
GGCGCCGCTTTGTTCGGGATGATGCACTGCGGTGCGGGCC. The amino acid 
sequence of the encoded protein is shown in Sequence ID No 57. The resulting colonies 
were analysed by restriction enzyme digetion and confirmed by DNA sequencing. 
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Biological Activity 

(i) Anti-complement Activity Measured by the Classical Pathway-mediated 
Haemolysis of Sheep Erythrocytes 

Functional activity of complement inhibitors was assessed by measxiring the 
inhibition of complement-mediated lysis of sheep erythrocytes sensitised with rabbit 
antibodies (Diamedix Corporation, Miami, USA). Human serum diluted 1:125 or 1:100 
in 0.1 M Hepes/0,15 M NaCl/0.1% gelatin pH 7 A was used as a source of complement. 
The serum was prepared from a pool of volunteers essentially as described in Dacie & 
Lewis, 1 975. Briefly, blood was warmed to 37°C for 5 minutes, the clot removed and 
the remaining serum clarified by centririigation. The serum fraction was split into small 
aliquots and stored at -196°C. Aliquots were thawed as required and diluted in the 
Hepes buffer immediately before use. 

Inhibition of complement-mediated lysis of sensitised sheep erythrocytes was 
measured using a standard haemolytic assay using a v-bottom microtitre plate format as 
follows:. 

50 jil of a range of concentrations of inhibitor (typically in the region of 0.1 - 100 

nM) diluted in Hepes buffer were mixed with 50 pi of the diluted serum and 100 ul of 

prewarmed sensitised sheep erythrocytes and then incubated for 1 hour at 37°C Samples 

were spun at I600rpm at ambient temperature for 3 minutes before transferring 150 ll! of 

supernatant to flat bottom microtitre plates and determining the absorption at 410 run.. 

Maximum lysis (Amax) was determined by incubating serum with erythrocytes in the 

absence of any inhibitor. Background lysis (Ao) was determined by incubating 

erythrocytes in the absence of any serum or inhibitor To check whether the inhibitor 

itself had any effect on lysis, erythrocytes were incubateH with inhibitor alone; none of 

the compounds had any direct effect on lysis of the red blood cells. Inhibition was 

expressed as a fraction of the total ceil lysis such that IH50 represents the concentration 

of inhibitor required to give 50% inhibition of lysis. 

IH = A- Ao 
Amax - Ao 

where 0 is equivalent to complete inhibition and 1 equals no inhibition. 

The CM7 protein product of Example 1 inhibited complement-mediated lysis of 
sensitised sheep red blood cells with an IH50 of approx. 6 nM, In a separate 
experiment, CM7 and CM7/cys preparations gave IH50 values of about 20-3GnM and 
they were experimentally indistinguishable from each other. 
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(ii) Anti-complement Activity Measured by Alternative Pathway-mediated 
Haemolysis of Guinea Pig Erythrocytes 

Functional activity of complement inhibitors was assessed by measuring the 
inhibition of complement mediated lysis of guinea pig erythrocytes essentially as 
described by Scesney, 5. M. et al (1996) J. Immunol. 26 1729-1735. The assay is 
designed to be specific for the alternative pathway of complement activation. Human 
serum prepared from a pool of volunteers essentially as described in Dacie & Lewis, 
1975 wasused as the source of complement Briefly, blood was warmed to 37°C for 5 
minutes, the clot removed and the remaining serum clarified by centrifugation. The 
serum fraction was split into small aliquots and stored at -196°C Aliquots were thawed 
as required and diluted in 0.1 M Hepes/ 0.15 M NaCI / 0. 1 % gelatin / 8 mM EGTA / 5 
mM MgCl2 pH 7.4 (buffer A) immediately before use. Guinea pig erythrocytes were 
prepared from guinea pig whole blood collected into EDTA-coated tubes as follows. 
The blood was spun at 1600 rpm for 5 min and the erythrocyte pellet washed 3 times with 
0. 1 M Hepes/ 0. 15 M NaCI / 0. 1% gelatin pH 7.4 until the supernatant of the spin was 
essentially colourless. The erythrocytes were finally resnspended to the original volume 
of blood used and were stored at + 4 degrees C. They were used within 2 weeks. 

50 |il of a range of concentrations of inhibitor diluted in buffer A in a v-bottom 
microtitre plate were mixed with, first, 100 pi of serum that had been diluted 1:3 and 
second, 50 p.1 of guinea pig erythrocytes (diluted 1 :49 in buffer A) and incubated for 1 
hour at 37°C. The plate was spun at 1600 rpm for 3 minutes before transferring 150 \xl 
of each supernatant to a flat bottom microtitre plate and determining the absorption at 405 
nm, which reflects the amount of lysis in each test solution. Maximum lysis (Amax) was 
determined by incubating serum with erythrocytes in the absence of any inhibitor. 
Background lysis (Ao) was determined by incubating erythrocytes in the absence of any 
serum- or inhibitor. To check whether the inhibitor itself had any effect on lysis, 
erythrocytes were incubated with inhibitor alone; none of the compounds had any direct 
effect on lysis of the red blood cells. The final dilution of serum used in the assay did 
absorb at 405nm but the level of absorbance (approx 10% of Amax) was considered to 
have a neglible affect on the overall assay results and it was ignored in the calculations. 
Inhibition was expressed as a fraction of the total ceil lysis such that IH50 represents the 
concentration of inhibitor required to give 50% inhibition of Lysis. 



% inhibition = I - [(A - Ao) / (.Amax - Ao)] 
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CM7 final product similar to that described in Example I was assayed in the guinea pig 
haemolvsis assay. In two separate assays the product inhibited haemolysis with IH50 
values of 170nM and 1 80nM respectively. 



(iii) Inhibition of zymosan A-induced activation of the alternative pathway by CM7. 

The alternative pathway of complement was activated with 2ymosan A 7 a complex 
carbohydrate from yeast (Sigma, catalogue number Z-4250). Zymosan A was made 50 
mg/ml in Hepes buffer (0.1M Hepes/Q.iSMNaCl pH 7.4) and vortexed until a fine 
suspension had formed. Human serum was preincubated with different concentrations of 
complement inhibitor diluted in Hepes buffer for 15 mins at 3 7 (SYMBOL 176 \f 
"Symbol"} C using the volumes given beiow. Zymosan A was then vortexed for a few 
seconds each time before addition to the samples after which samples were incubated for 
a further 30 mins at 37{SYMBOL 176 \f "Symbol"} C. The zymosan A was then spun 
down at approximately 1 l,000g for 30 seconds at ambient temperature. 100 ul of 
supernatant were added to an equal volume of precipitating solution provided in the kit 
and assayed as described in the technical bulletin of Amersham with the C3a des Arg 
assay RIA kit purchased from Amersham International pic, U. K., (human complement 
C3a des Arg [ 125 I]assay, code RPA 518). In the C3a RIA Assay, activation of 
complement pathways can be followed by measuring the release of the anaphylatoxin, 
C3a and its breakdown product C3a des Arg. Both products can be measured using a 
competitive radio-immune assay. Each sample was assayed in duplicate and a useful 
dilution was 1/100. 

Volumes of samples added 

serum inhibitor Zymosan A 

Normal Assay 89 pi 20 ^1 21 fil 

The data were computed essentially as described in the Amersham technical 
bulletin with the exception that the standard curve was not used and data were calculated 
only as B/Bo. 

Controls included maximum activation (A) i.e. serum + zymosan A only, background 
activation (B) i.e. serum + buffer only and background activation in the presence ot 
inhibitor (C) i.e. serum + inhibitor only. D is the value of activation of serum in the 
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presence of inhibitor and zymosan A, . These values could then be used to determine the 
% inhibition at each inhibitor concentration, using the following formula I. Note that the 
formula looks unusual because of the nature of the assay, in particular that because the 
assay is a competition assay all factors are inversed eg. maximum activation actually 
gives the lowest counts in the assay. 

Formula I : D- A X 100 

C- A 

The IC50 is defined as the concentration of inhibitor required to reduce maximum 
activation by 50%. Using the data generated experimentally and reproduced in the table 
below, the IC50 for CM 7 of Example 1 was about luM and was indistinguishable from 
that of SCR1-3 prepared as described in WO94/00571 





SCR1-3 


CM7 


21.5 




106 


15.4 


87 




5.4 




59 


3.8 


65 




L.35 




54 


0.96 


43 




0.34 




15 


0.24 


17 




0.084 




7 


0.06 


6 




0.021 




-2 


0.015 
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SEQUENCE LXSTING 

SEQ ID No . 1 CM7 amino acid sequence 



1 


Met: 


Gin 


Cys 


Asn 


Ala Pro 


Glu 


Trp 


Leu 


Pro 


11 


Phe 


Ala 


Arg 


Pro 


Thr Asn 


Leu 


Thr 


Asp 


Glu 


21 


Phe 


Glu 


Phe 


Pro 


He Gly Thr 


Tyr 


Leu 


Asn 


31 


Tyr 


Glu 


Cys 


Arg 


Pro Gly Tyr 


Ser 


Gly Arg 


41 


Pro 


Phe 


Ser 


lie 


lis Cys 


Leu 


Lys 


Asn 


Ser 


51 


Val 


Trp 


Thr 


Gly 


Ala Lys 


Asp 


Arg 


Cys 


Arg 


61 


Arg 


Lys 


Ser 


Cys 


Arg Asn Pro 


Pro 


Asp 


Pro 


71 


val 


Asn 


Gly 


Met 


Val His 


Val 


lie 


Lys 


Gly 


81 


lie 


Gin 


Phe 


Gly Ser Gin 


lie 


Lys 


Tyr 


Ser 


91 


Cys 


Thr 


Lys 


Gly 


Tyr Arg Leu 


lie 


Gly Ser 


101 


Ser 


Ser 


Ala 


Thr 


Cys lie 


lie 


Ser 


Gly Asp 


111 


Thr 


Val 


lie 


Trp 


Asp Asn 


Glu 


Thr 


Pro 


He 


121 


Cys 


Asp 


Arg 


He 


Pro Cys 


Gly 


Leu 


Pro 


Pro 


131 


Thr 


lie 


Ala 


Asn 


Gly Asp 


Phe 


Thr 


Ser 


He 


141 


Ser 


Arg 


Glu 


Tyr 


Phe His 


Tyr Gly Ser 


Val 


151 


Val 


Thr 


Tyr 


His 


Cys Asn 


Leu 


Gly Ser 


Arg 


151 


Gly Lys 


Lys 


Val 


Phe Glu 


Leu 


Val 


Gly 


Glu 


171 


Pro 


Ser 


lie 


Tyr 


Cys Thr 


Ser 


Lys 


Asp 


Asp 


181 


Gin 


Val 


Gly 


lie 


Trp Ser 


Gly 


Pro 


Ala 


Pro 


191 


Gin 


Cys 


lie 


lie 


Pro Asa 


Lys 









SEQ ID No. 2 CM7 DNA 



1 


ATGCAGTGCA 


51 


GACTGATGAA 


1Q1 


GCCCGGGTTA 


151 


GTCTGGACTG 


201 


GCCAGATCCG 


251 


GTTCCCAAAT 


301 


TCCAGCGCTA 


351 


AACACCGATT 


401 


ACGGTGATTT 



sequence 

ACGCTCCGGA ATGGCTGCCG 
TTTGAGTTCC CGATCGGTAC 
TAGCGGCCGC CCGTTTTCTA 
GTGCTAAGGA CCGTTGCCGA 
GTTAACGGCA TGGTGCATGT 
TAAATATTCT TGTACTAAAG 
CATGCATCAT CTCTGGTGAT 
TGTGACCGAA TTCCGTGTGG 
CACCTCTATC AGTCGCGAGT 



TTCGCGCGCC CGACCAACCT 
CTACCTGAAC TACGAATGCC 
TCATCTGCCT GAAAAACTCT 
CGTAAATCTT GTCGTAATCC 
GATCAAAGGC ATCCAGTTCG 
GTTACCGTCT GATTGGTTCC 
ACTGTCATTT GGGATAATGA 
TCTGCCGCCG ACCATCGCCA 
ATTTTCACTA TGGTTCTGTG 
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4-5 X GTGACCTACC ACTGCAATCT GGGTAGCCGT GGTAAAAAGG TGTTTGAGCT 
501 CGTGGGTGAG CCGTCCATCT ACTGCACTAG CAAAGACGAT CAAGTGGGCA 
551 TCTGGAGCGG CCCGGCACCG CAGTGCATCA TCCCGAACAA A 

SEQ IB No. 3 DNA sequence 

1 CGACCATCGC CAACGGTGAT TTCACCTCTA TCAGTCGCGA GTATTTTCAC 

SEQ ID No. 4 DNA sequence 

1 GTGAAAATAC TCGCGACTGA TAGAGGTGAA ATCACCGTTG GCGATGGTCG 

SEQ ID No. 5 DNA sequence 

1 GACCTACCAC TGCAATCTGG GTAGCCGTGG TAAAAAGGTG TTTGAGC 

SEQ ID No . 6 DNA sequence 

I GCTCAAACAC CTTTTTACCA CGGCTACCCA GATTGCAGTG GTAGGTC 

SSQ ID No- 7 DNA sequence 

1 GCACTAGCAA AGACGATCAA GTGGG 

SEQ ID No. 3 DNA sequence 

1 CCCACTTGAT CGTCtTTgCT AGTGC 

SSQ ID No. 9 CM1 amino acid sequence 



1 


Met 


Gin 


Cys 


Asn Ala 


Pro 


Glu 


Trp 


Leu 


Pro 


11 


Phe 


Ala 


Arg 


Pro Thr 


Asn 


Leu 


Thr 


Asp 


Glu 


21 


She 


Glu 


Phe 


Pro He 


Gly 


Thr 


Tyr 


Leu 


Asn 


31 


Tyr 


Giu 


Cys 


Arg Pro 


Gly 


Tyr 


Ser 


Gly 


Arg 


41 


Pro 


Phe 


Ser 


He He 


Cys 


Leu 


Lys 


Asn 


Ser 


51 


Val 


Trp 


Thr Gly Ala 


Lys 


Asp 


Arg 


Cys 


Arg 


51 


Arg 


Lys 


Ser 


Cys Arg 


Asn 


Pro 


Pro 


Asp 


Pro 


71 


Val 


Asn 


Gly 


Met Val 


His 


Val 


He 


Lys 


Gly 


81 


He 


Gin 


Phs 


Gly Ser 


Gin 


He 


Lys 


Tyr 


Ser 


91 


Cys 


Thr 


Lys 


Gly Tyr 


Arg 


Leu 


He 


Gly 


Ser 


1Q1 


Ser 


Ser 


Ala 


Thr Cys 


He 


He 


Ser 


Gly 


Asp 


111 


Thr 


Val 


He 


Trp Asp 


Asn 


Glu 


Thr 


Pro 


He 


121 


Cys 


Asp 


Arg 


Ha Pro 


Cys 


Gly 


Leu 


Pro 


Pro 


131 


Thr 


He 


Ala Asn Gly 


Asp 


Phe 


Thr 


Ser 


He 
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141 Ser Arg Glu Tyr 

151 Val Thr Tyr Arg 

161 Gly Arg Lys Val 

171 Pro Ser lie Tyr 

181 Gin Val Gly lie 

191 Gin Gys lie lie 
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Phe His Tyr Gly Ser Val 
Cys Asn Pro Gly Ser Gly 
Phe Glu Leu Val Gly Glu 
Cys Thr Ser Asn Asp Asp 
Trp Ser Gly Pro Ala Pro 
Pro Asn Lys 



SSQ ID No. 10 CM1 DNA sequence 



x 


ATGCAGTGCA 


ACGCTCCGGA ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


G C\_r'L" *' i *c TAi 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


ATCCAG TTCQ 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCgCCA 


401 


ACGGTGATTT 


CACCTCTATC 


AGTCGCGAGT 


ATTTTCACTA 


TGGTTCTGTG 


4S1 


GTGACCTACC 


GCTGCAATCC 


GGGTAGCGGT 


GGTCGTAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


TAATGACGAT 


CAAGTGGGCA 


551 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAACAA 


A 



SEQ ID No . 11 CM2 amino acid sequence 



1 


Met 


Gin 


Cys 


Asn Ala 


Pro 


Glu 


Trp 


Leu 


Pro 


11 


Phe 


Ala 


Arg 


Pro Thr 


Asn 


Leu 


Thr 


Asp 


Glu 


21 


Phe 


Glu 


Phe 


Pro lie 


Gly 


Thr 


Tyr 


Leu 


Asn 


31 


Tyr 


Glu 


Cys 


Arg Pro 


Gly 


Tyr 


Ser 


Gly 


Arg 


41 


Pro 


Phe 


Ser 


He He 


Cys 


Leu 


Lys 


Asn 


Ser 


51 


Val 


Trp 


Thr 


Gly Ala 


Lys 


Asp 


Arg 


Cys 


Arg 


61 


Arg 


Lys 


Ser 


Cys Arg 


Asn 


Pro 


Pro 


Asp 


Pro 


71 


val 


Asn 


Gly 


Met Val 


His 


Val 


He 


Lys 


Gly 


81 


He 


Gin 


Phe 


Gly Ser 


Gin 


He 


Lys 


Tyr 


Ser 


91 


Cys 


Thr 


Lys 


Gly Tyr 


Arg 


Leu 


lie 


Gly 


Ser 


101 


Ser 


Ser 


Ala 


Thr Cys 


He 


He 


Ser 


Gly 


Asp 


111 


Thr 


Val 


He 


Trp Asp 


Asn 


Glu 


Thr 


Pro 




121 


Cys 


Asp 


Arg 


■ He Pro 


Cys 


Gly 


Leu 


Pro 


Pro 


131 


Thr 


He 


Thr 


Asn Gly 


Asp 


Phe 


He 


Ser 


Thr 


141 


Asn 


Arg 


Glu 


Asn Phe 


His 


Tyr 


Gly 


Ser 


Val 
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151 Val Thr Tyr His 

1S1 Gly Lys Lys Val 

171 Pro Ser He Tyr 

131 Gin Val Gly lie 

151 Gin Cys lie lie 
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Cys Asn Leu Gly Ser Arg 
Phe Glu Leu Val Gly Glu 
Cys Thr Ser Asn Asp Asp 
Trp Ser Gly Pro Ala Pro 
Pro Asn Lys 



ID No 


12 CM2 DNA sequence 


1 


ATGCAGTGCA 


ACGCTCCGGA 


SI 


GACTGATGAA 


TTTGAGTTCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


151 


GTCTGGACTG 


GTGCTAAGGA 


201 


GCCAGATCCG 


GTTAACGGCA 


251 


GTTCCCAAAT 


TAAATATTCT 


301 


TCCAGCGCTA 


CATGCATCAT 


331 


AACACCGATT 


TGTGACCGAA 


401 


ACGGTGATTT 


CATCTCTACC 


451 


GTGACCTACC 


ACTGCAATC7 


501 


CGTGGGTGAG 


CCGTCCATC7 


551 


TCTGGAGCGG 


CCCGGCACCG 



ATGGC7GCCG TTCGCGCGCC CGACCAACCT 
CGATCGGTAC CTACCTGAAC TACGAATGCC 
CCGTTTTCTA TCATCTGCCT GAAAAACTCT 
CCGTTGCCGA CGTAAATCTT GTCGTAATCC 
TGGTGCATGT GATCAAAGGC ATCCAGTTCG 
7GTACTAAAG GTTACCGTCT GATTGGTTCC 
C7C7GG7GAT AC7G7CAT7T GGGA7AATGA 
T7CCGTGTGG TCTGCCGCCG ACCATCACCA 
AA7CG CGAG A ATTTTCACTA TGGTTCTGTG 
GGG7AGCCGT GGTAAAAAGG TG7TTGAGCT 
ACTGCACTAG TAATGACGAT CAAGTGGGCA 
CAGTGCATCA TCCCGAACAA A 



SEQ ZD No. 13 CM3 amino acid sequence 



1 


Met: 


Gin 


11 


Phe 


Ala 


21 


Phe 


Glu 


31 


Tyr 


Glu 


41 


Pro 


Phe 


51 


Val 


Trp 


51 


Arg 


Lys 


71 


Val 


Asn 


ai 


lie 


Gin 


91 


Cys 


Thr 


101 


Ser 


Ser 


111 


Thr 


Val 


121 


Cys 


Asp 


131 


Thr 


lie 


141 


Asn 


Arg 



Cys Asn Ala Pro 
Arg Pro Thr Asn 
Phe Pro lie Gly 
Cys Arg Pro Gly 
Ser He lie Cys 
Thr Gly Ala Lys 
Ser Cys Arg Asn 
Gly Met Val His 
Phe Gly Ser Gin 
Lys Gly Tyr Arg 
Ala Thr Cys lie 
He Trp Asp Asn 
Arg He Pro Cys 
Thr Asn Gly Asp 
Glu Asn Phe His 



Glu 


Trp 


Leu 


Pro 


Leu 


Thr 


Asp 


Glu 


Thr 


Tyr 


Leu 


Asn 


Tyr 


Ser 


Gly 


Arg 


Leu 


Lys 


Asn 


Ser 


Asp 


Arg 


Cys 


Arg 


Pro 


Pro 


Asp 


Pro 


Val 


He 


Lys 


Gly 


He 


Lys 


Tyr 


Ser 


Leu 


He 


Gly 


Ser 


He 


Ser 


Gly 


Asp 


Glu 


Thr 


Pro 


He 


Gly 


Leu 


Pro 


Pro 


Phe 


lie 


Ser 


Thr 


Tyr 


Gly 


Ser 


Val 
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151 


Val 


Thr 


Tyr 


Arg 


Cys 


Asn Pro Gly Ssr 


Gly 


16 X 


Gly Arg Lys 


Val 


Phe 


Glu Leu 


Val Gly 


Glu 


171 


Pro 


Ser 


He 


Tyr Cys 


Thr Ser 


Lys Asp 


Asp 


iai 


Gin 


Val 


Gly 


Ha 


Trp 


Ser Gly 


Pro Ala 


Pro 


191 


Gin 


Cys 


He 


He 


Pro 


Asn Lys 







SEQ ID No. 14 CM3 DNA sequence 



1 


ATGCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


ATCCAGTTCG 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCACCA 


401 


ACGGTGATTT 


CATCTCTACC 


AATCGCGAGA 


ATTTTCACTA 


TGGTTCTGTG 


4S1 


GTGACCTACC 


GCTGCAATCC 


GGGTAGCGGT 


GGTCGTAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


CAAAGACGAT 


CAAGTGGGCA 


551 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAACAA 


A 



SEQ XV No. 15 CMS amino acid secTience 



1 


Met 


Gin 


Cys 


Asn 


Ala 


Pro 


Glu 


Trp 


Leu 


Pro 


11 


Phe 


Ala 


Arg 


Pro 


Thr 


Asn 


Leu 


Thr 


Asp 


Glu 


21 


Phe 


Glu 


Phe 


Pro 


He 


Gly 


Thr 


Tyr 


Leu 


Asn 


31 


Tyr 


Glu 


Cys 


Arg 


Pro 


Gly 


Tyr 


Ser 


Gly 


Arg 


41 


Pro 


Phe 


Ser 


He 


He 


Cys 


Leu 


Lys 


Asn 


Ser 


51 


Val 


Trp 


Thr 


Gly 


Ala 


Lys 


Asp 


Arg 


Cys 


Arg 


SI 


Arg 


Lys 


Ser 


Cys 


Arg 


Asn 


Pro 


Pro 


Asp 


Pro 


71 


Val 


Asn 


Gly 


Met 


Val 


His 


Val 


He 


Lys 


Gly 


81 


He 


Gin 


Phe 


Gly 


Ser 


Gin 


He 


Lys 


Tyr 


Ser 


91 


Cys 


Thr 


Lys 


Gly 


Tyr 


Arg 


Leu 


He 


Gly 


Ser 


101 


Ser 


Ser 


Ala 


Thr 


Cys 


He 


He 


Ser 


Gly 


Asp 


111 


Thr 


Val 


He 


Trp 


Asp 


Asn 


Glu 


Thr 


Pro 


He 


121 


Cys 


Asp 


Arg 


He 


Pro 


Cys 


Gly 


Leu 


Pro 


Pro 


131 


Thr 


He 


Ala 


Asn 


Gly 


Asp 


Phe 


Thr 


Ser 


He 


141 


Ser 


Arg 


Glu 


Tyr 


Phe 


His 


Tyr 


Gly 


Ser 


Val 
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151 Val Thr Tyr Arg 

1S1 Gly Arg Lys Val 

171 Pro Ser lie Tyr 

1S1 Gin Val Gly lie 

191 Gin Cys He He 



47 

Cys Asti Pro Gly Ser Gly 
Phe Glu Leu Val Gly Glu 
Cys Thr Ser Lys Asp Asp 
Trp Ser Gly Pro Ala Pro 
Pre Asn Lys 



SSQ 1X3 No, IS CMS DNA sequence 





ATGCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGG TTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


151 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


ATCCAGTTCG 


251 


GTTCC CAAAT 


TAAATATTC7 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCGCCA 


401 


ACGGTGATTT 


CACCTCTATC 


AGTCGCGAGT 


ATTTTCACTA 


TGGTTCTGTG 


451 


GTGACCTACC 


GCTGCAA7CC 


GGGTAGCGGT 


GGTCGTAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCA7CT 


AC7GCACTAG 


CAAAGACGAT 


CAAGTGGGCA 


S51 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAACAA 


A 



SEQ ID No. 17 CMS amino acid sequence 



1 


Met 


Gin 


Cys 


Asn 


Ala 


Pro 


Glu 


Trp 


Leu 


Pro 


11 


Phe 


Ala 


Arg 


Pro 


Thr 


Asn 


Leu 


Thr 


Asp 


Glu 


21 


Phe 


Glu 


Phe 


Pro 


He 


Gly 


Thr 


Tyr 


Leu 


Asn 


31 


Tyr 


Glu 


Cys 


Arg 


Pro 


Gly 


Tyr 


Ser 


Gly 


Arg 


41 


Pro 


Phe 


Ser 


He 


He 


Cys 


Leu 


Lys 


Asn 


Ser 


51 


Vai 


Trp 


Thr 


Gly 


Ala 


Lys 


Asp 


Arg 


Cys 


Arg 


SI 


Arg 


Lys 


Ser 


Cys 


Arg 


Asn 


Pro 


Pro 


Asp 


Pro 


71 


Val 


Asn 


Gly 


Met 


Val 


His 


Val 


He 


Lys 


Gly 


81 


lie 


Gin 


Phe 


Gly 


Ser 


Gin 


He 


Lys 


Tyr 


Ser 


91 


Cys 


Thr 


Lys 


Gly 


Tvr 


Arg 


Leu 


Xle 


Gly 


Ser 


101 


Ser 


Ser 


Ala 


Thr 


Cys 


He 


He 


Ser 


Gly 


Asp 


Hi 


Thr 


Val 


He 


Trp 


Asp 


Asn 


Glu 


Thr 


Pro 


He 


121 


Cys 


Asp 


Arg 


He 


Pro 


Cys 


Gly 


Leu 


Pro 


Pro 


131 


Thr 


Xle 


Thr 


Asn 


Gly 


Asp 


Phe 


He 


Ser 


Thr 


141 


Asn 


Arg 


Glu 


Asn 


Phe 


His 


Tyr Gly 


Ser 


Val 
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Vsl T hr Tvr His 


Cvs 


Asn Leu Gly Ser 


Arg 


161 


Gly Lys Lys Val 


Phe 


Glu Leu Val Gly Glu 


171 


Pro Ser lis Tyr 


Cys 


Thr Ser Lys Asp 


Asp 


131 


Gin Val Gly He 


Trp 


Ser Gly Pro Ala 


Pro 


191 


Gin Cys lie He 


Pro 


Asn Lys 





SHQ ID No. 18 CMS DNA sequence 



1 


ATGCAGTGCA 


ACGCTCCGGA 


ATGGCTGCCG 


TTCGCGCGCC 


CGACCAACCT 


51 


GACTGATGAA 


TTTGAGTTCC 


CGATCGGTAC 


CTACCTGAAC 


TACGAATGCC 


101 


GCCCGGGTTA 


TAGCGGCCGC 


CCGTTTTCTA 


TCATCTGCCT 


GAAAAACTCT 


1S1 


GTCTGGACTG 


GTGCTAAGGA 


CCGTTGCCGA 


CGTAAATCTT 


GTCGTAATCC 


201 


GCCAGATCCG 


GTTAACGGCA 


TGGTGCATGT 


GATCAAAGGC 


AT CCAGTTCG 


251 


GTTCCCAAAT 


TAAATATTCT 


TGTACTAAAG 


GTTACCGTCT 


GATTGGTTCC 


301 


TCCAGCGCTA 


CATGCATCAT 


CTCTGGTGAT 


ACTGTCATTT 


GGGATAATGA 


351 


AACACCGATT 


TGTGACCGAA 


TTCCGTGTGG 


TCTGCCGCCG 


ACCATCACCA 


401 


ACGGTGATTT 


CATCTCTACC 


AATCGCGAGA 


ATTTTCACTA 


TGGTTCTGTG 


451 


GTGACCTACC 


ACTGCAATCT 


GGGTAGCCGT 


GGTAAAAAGG 


TGTTTGAGCT 


501 


CGTGGGTGAG 


CCGTCCATCT 


ACTGCACTAG 


CAAAGACGAT 


CAAGTGGGCA 


551 


TCTGGAGCGG 


CCCGGCACCG 


CAGTGCATCA 


TCCCGAACAA 


A 



SEQ ID No. 19 CM8 amino acid sequence 

I Met Arg He Pro Cys Gly Leu Pro Pro Thr 

II He Ala Asn Gly Asp Phe Thr Ser He Ser 
21 Arg Glu Tyr Phe His Tyr Gly Ser Val Val 
31 Thr Tyr Arg Cys Asn Pro Gly Ser Gly Gly 
41 Arg Lys Val Phe Glu Leu Val Gly Glu Pro 
Si Ser He Tyr Cys Thr Ser Asn Asp Asp Gin 
SI Val Gly He Trp Ser Gly Pro Ala Pro Gin 
71 Cys He He Pro Asn Lys 
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SEQ ID NO. 20 CM3 DNA sequence 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCGCCAACG GTGATTTCAC 

SI CTCTATCAGT CGCGAGTATT TTCACTATGG TTCTGTGGTG ACCTACCGCT 

101 GCAATCCGGG TAGCGGTGGT CGTAAGGTGT TTGAGCTCGT GGGTGAGCCG 

151 TCCATCTACT GCACTAGTAA TGACGATCAA GTGGG CATCT GGAGCGGCCC 

201 GGCACCGCAG TGCATCATCC CGAACAAA 

SEQ ID No. 21 CM9 amino acid sequence 





Met 


Arg 


lie 


Pro 


Cys 


Gly 


Leu 


Pro 


Pre 


Thr 


11 


lie 


Thr Asn Gly As? 


Phe 


lie 


Ser 


Thr 


Asn 


21 


Arg 


Glu 


Asn 


Phe 


His 


Tyr 


Gly Ser 


Vai 


Vai 


31 


Thr 


Tyr 


His 


Cys 


Asn 


Leu 


Gly 


Ser 


Arg 


Gly 


41 


Lys 


Lys 


Val 


Phe 


Glu 


Leu 


Vai 


Gly Glu 


Pro 


51 


Ser 


He 


Tyr 


Cys 


Thr 


Ser 


Asn 


Asp 


Asp 


Gin 


€1 


Val 


Gly 


He 


Trp 


Ser 


Gly 


Pro 


Ala 


Pro 


Gin 


71 


Cys 


He 


He 


Pro 


Asn 


Lys 











SEQ ID Ho. 2 2 CHS DNA sequence 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCACCAACG GTGATTTCAT 
51 CTCTACCAAT CGCGAGAATT TTCACTATGG TTCTGTGGTG ACCTACCACT 
101 GCAATCTGGG TAGCCGTGGT AAAAAGGTGT TTGAGCTCGT GGGTGAGCCG 
151 TCCATCTACT GCACTAGTAA TGACGATCAA GTGGGCATCT GGAGCGGCCC 
201 GGCACCGCAG TGCATCATCC CGAACAAA 

SEQ ID No. 23 OHO amino acid sequence 



1 


Met Arg lie Pro Cys Gly Leu Pro Pro 


Thr 


11 


He Thr Asn Gly Asp 


Phe lie Ser Thr 


Asn 


21 


Arg Glu Asn Phe His 


Tyr Gly Ser Val 


Val 


31 


Thr Tyr Arg Cys Asn 


Pro Gly Ser Gly Gly 


41 


Arg Lys Val Phe Glu 


Leu Val Gly Glu 


Pro 


51 


Ser He Tyr Cys Thr 


Ser Lys Asp Asp 


Gin 


51 


Val Gly He Trp Ser 


Gly Pro Ala Pro 


Gin 


71 


Cys He He Pro Asn 


Lys 





WO 98/39433 



50 



PCT/GB98/00727 



SEQ ID No. 24 CM10 DNA sequence 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCACCAACG GTGATTTCAT 

51 CTCTACCAAT CGCGAGAATT TTCACTATGG TTCTGTGGTG ACCTACCGCT 

101 GCAATCCGGG TAGCGGTGGT CGTAAGGTGT TTGAGCTCGT GGGTGAGCCG 

1S1 TCCATCTACT GCACTAGCAA AGACGATCAA GTGGGCATCT GGAGCGGCCC 

201 GGCACCGCAG TGCATCATCC CGAACAAA 

SSQ ID No. 25 CM12 amino acid sequence 



1 


Met 


Arg 


lie 


Pro 


Cys 


Gly 


Leu 


Pro 


Pro 


Thr 


11 


lis 


Ala 


Asn Gly 


Asp 


Phe 


Thr 


Ser 


He 


Ser 


21 


Arg 


Glu 


Tyr 


Phe 


His 


Tyr 


Gly 


Ser 


Val 


Val 


31 


Thr 


Tyr 


Arg 


Cys 


Asn 


Pro 


Gly 


Ser 


Gly 


Gly 


41 


Arg 


Lys 


Val 


Phe 


Glu 


Leu 


Val 


Gly Glu 


Pro 


51 


Ser 


lie 


Tyr 


Cys 


Thr 


Ser 


Lys 


Asp 


Asp 


Gin 


61 


Val 


Gly 


He 


Trp 


Ser 


Gly 


Pro 


Ala 


Pro 


Gin 


71 


Cys 


lie 


He 


Pro 


Asn 


Lys 











SEQ ID No. 26 CM12 DNA sequence 

I ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCGCCAACG GTGATTTCAC 
51 CTCTATCAGT CGCGAGTATT TTCACTATGG TTCTGTGGTG ACCTACCGCT 

101 GCAATCCGGG TAGCGGTGGT CGTAAGGTGT TTGAGCTCGT GGGTGAGCCG 
15! TCCATCTACT GCACTAGCAA AGACGATCAA^ GTGGGCATCT GGAGCGGCCC 
201 GGCACCGCAG TGCATCATCC CGAACAAA 

SEQ ID No. 27 CM13 amino acid se<ruence 

Met Arg He Pro Cys Gly Leu Pro Pro Thr 

II He Thr Asn Gly As? Phe lie Ser Thr Asn 
21 Arg Glu Asn She His Tyr Gly Ser Val Val 
31 Thr Tyr His Cys Asn Leu Gly Ser Arg Gly 
41 Lys Lys Val Phe Glu Leu Val Gly Glu Pro 
51 Ser He Tyr Cys Thr Ser Lys Asp As? Gin. 
SI Val Gly Ha Tr? Ser Gly Pro Ala Pro Gin 
71 Cys He He Pro Asn Lys 
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SEQ ID No. 2 8 CM13 DNA sequence 

X ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCACCAACG GTGATTTCAT 
51 CTCTACCAAT CGCGAGAATT TTCACTATGG TTCTGTGGTG ACCTACCACT 
101 GCAATCTGGG TAGCCGTGGT AAAAAGGTGT TTGAGCTCGT GGGTGAGCCG 
151 T C CATCT ACT GCACTAGCAA AGACGATCAA GTGGGCATCT GGAGCGGCCC 
201 GGCACCGCAG TGCATCATCC CGAACAAA 

SEQ ID No. 2 9 CM14 amino acid sequence 



1 


Met 


Arg 


lie 


Pro 


Cys 


Gly 


Leu 


Pro 


Pro 


Thr 


11 


lie 


Ala 


Asn Gly Asp 


Phe 


Thr 


Ser 


lie 


Ser 


21 


Arg Giu Tyr 


Phe 


His 


Tyr 


Gly 


Ser 


Val 


Val 


31 


Thr 


Tyr 


His 


Cys 


Asn 


Leu 


Gly 


Ser 


Arg 


Gly 


41 


Lys 


Lys 


Val 


Phe 


Giu 


Leu 


Val 


Gly Giu 


Pro 


51 


Ser 


lie 


Tvr 


Cys 


Thr 


Ser 


Lys 


Asp 


Asp 


Gin 


SI 


Val 


Gly 


He 




Ser 


Gly 


Pro 


Ala 


Pro 


Gin 


71 


Cys 


lie 


lie 


Pro 


Asn 


Lys 











SEQ XD No. 3 0 Oil 4 DNA sequence 

1 ATGCGAATTC CGTGTGGTCT GCCGCCGACC ATCGCCAACG GTGATTTCAC 
51 CTCTATCAGT CGCGAGTATT TTCACTATGG TTCTGTGGTG ACCTACCACT 
101 GCAATCTGGG TAGCCGTGGT AAAAAGGTGT TTGAGCTCGT GGGTGAGCCG 
151 TC CATCT ACT GCACTAGCAA AGACGATCAA GTGGGCATCT GGAGCGGCCC 
201 GGCACCGCAG TGCATCATCC CGAACAAA 

SEQ XD No. 31 CM7/cys amino acid sequence 



1 


Met Gin 


Cys 


Asn Ala 


Pro 


Giu 


Trp 


Leu Pro 


11 


Phe Ala 


Arg 


Pro Thr 


Asn 


Leu 


Thr 


Asp Giu 


21 


Phe Giu 


Phe 


Pro lie 


Gly Thr 


Tyr 


Leu Asn 


31 


Tyr Giu 


Cys 


Arg Pro 


Gly Tyr Ser 


Gly Arg 


41 


Pro Phe 


Ser 


He lie 


Cys 


Leu Lys 


Asn Ser 


51 


Val Trp 


Thr 


Gly Ala 


Lys 


Asp 


Arg 


Cys Arg 


61 


Arg Lys 


Ser 


Cys Arg 


Asn 


Pro 


Pro 


Asp Pro 


71 


Val Asn Gly Met Val 


His 


Val 


He 


Lys Gly 


ai 


lie Gin 


Phe 


Gly Ser 


Gin 


lie 


Lys 


Tyr Ser 
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91 cys Thr Lys Gly Tyr Arg Leu He Gly Ser 

101 Ser Ser Ala Thr Cys He He Ser Gly Asp 

111 Thr Val lie Trp Asp Asn Glu Thr Pro He 

121 Cys Asp Arg He Pro Cys Gly Leu Pro Pro 

131 Thr lie Ala Asn Gly As? Phe Thr Ser He 

14! Ser Arg Glu Tyr Phe Eis Tyr Gly Ser Val 

151 val Thr Tyr His Cys Asn Leu Gly Ser Arg 

161 Gly Lys Lys Val Phe Glu Leu Val Gly Glu 

171 Pro Ser He Tyr Cys Thr Ser Lys Asp Asp 

131 Gin Val Gly He Trp Ser Gly Pro Ala Pro 

121 Gin Cys He Ha Pro Asn Lys Cys 

SSQ ID No. 32 DKA seOTer.cs 

1 CTGGAGCGGG CCCGCACCGC AGTGCATCAT CCCGAACAAA TGC7AATARA AGC 
SZQ ID No. 33 DNA sequence 

1 GCTTTTATTA GCATTTGTTC GGGATGATGC ACTGCGGTGC GGGCCCGCTC CAG 
SEQ ID No. 34 OiT/Cys-S-S- [MSWP-1I amino acid sequence 



Met Gin Cys Asn Ala Pro Glu Trp Leu Pro 
Phe Ala Arg Pro Thr Asn Leu Thr Asp Glu 
Phe Glu Phe Pro He Gly Thr Tyr Leu Asn 
31 Tyr Glu Cys Arg Pro Gly Tyr Ser Gly Arg 
Pro Phe Ser He He Cys Leu Lys Asn Ser 



l 

11 
-> i 



41 



51 Val Trp Thr Gly Ala Lys Asp Arg Cys Arg 

SI Arg Lys Ser Cys Arg Asn Pro Pro Asp Pro 

71 val Asn Gly Met Val Eis Val He Lys Gly 

81 He Gin Phe Gly Ser Gin He Lys Tyr Ser 

91 cys Thr Lys Gly Tyr Arg Leu He Gly Ser 

101 Ser Ser Ala Thr Cys He He Ser Gly As? 

Ill Thr Val He Trp Asp Asn Glu Thr Pro He 

121 Cys Asp Arg He Pro Cys Gly Leu Pro Pro 

131 Thr He Ala Asn Gly Asp Phe Thr Ser He 

141 Ser Arg Glu Tyr Phe Sis Tyr Gly Ser Val 

151 Val Thr Tyr His Cys Asn Leu Gly Ser Arg 
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161 * Gly Lys Lys Vai Phe Glu Leu Val Gly Glu 
171 Pro Ser lie Tyr Cys Thr Ser Lys Asp Asp 
131 Gin Val Gly Xie Trp Ser Gly Pro Ala Pro 
131 Gin Cys lie lie Pro Asn Lys Cys-S-S-Cys 

I I 
C0 2 H CONK : 

Asp Gly Pro Lys Lys Lys Lys Lys Lys Ser 
Pro Ser Lys Ser Ser Gly (N-Myristoyi) 

SEQ ID No. 35: Amino acid sequence of peptide used in MSWP synthesis 

I Gly Ser Ser Lys Ser Pro Ser Lys Lys Lys 

II Lys Lys Lys Pro Gly Asp Cys NH 2 

SSQ ID No. 3 6 CM15/cys amino acid sequence 

I Met Gin Cys Asn Val Pro Gin Trp Leu Pro 

II Phe Ala Arg Pro Thr Asn Leu Thr Asp Asp 
21 Phe Glu Phe Pro lie Gly Thr Tyr Leu Asn 
31 Tyr Glu Cys Arg Pro Gly Tyr Ser Gly Arg 
41 Pro Phe Ser lie lis Cys Leu Lys Asn Ser 
51 Val Trp Thr Ser Ala Lys Asp Lys Cys Lys 
61 Arg Lys Ser Cys Arg Asn Pro Pro Asp Pro 
71 Val Asn Gly Met Ala His Val lie Lys Asp 
31 lie Gin Phe Arg Ser Gin He Lys Tyr Ser 
31 Cys Pro Lys Gly Tyr Arg Leu He Gly Ser 
1Q1 Ser Ser Ala Thr Cys lie lie Ser Gly Asn 
ill Thr Val lie Trp Asp Asn Lys Thr Pro Val 
121 Cys Asp Arg He lie Cys Gly Leu Pro Pro 
131 Thr Lie Ala' Asn Gly Asp Phe Thr Ser He 
141 Ser Arg Giu Tyr Phe His Tyr Gly Ser Vai 
151 Val Thr Tyr His Cys Asn Leu Gly Ser Arg 
161 Gly Lys Lys Val Phe Giu Leu Val Gly Glu 
171 Pro Ser He Tyr Cys Thr Ser Lys Asp Asp 
18 1 Gin Vai Gly He Trp Ser Gly Pro Ala Pro 
191 Gin Cys He lis Pro Asn Lys Cys 
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SEQ ID No. 37 DNA sequence 

1 CAGTGCAACG TGCCGGAA7G G 

SEQ ID No. 33 DNA sequence 

1 CCATTCCGGA ACGXTGCACT G 

SEQ ID No. 39 DNA sequence 

1 GACTGATGAT TTTGAGTTCC 

SEQ ID No. 4Q DNA sequence 

1 GGAACTCAAA ATCATCAGTC 

SEQ ID No. 41 DNA sequence 

1 GTCTGGACTA GTGC7.-AGGA C.-AGTGCAAA CGT.-AATCTT GTCG 

SEQ ID No. 42 DNA sequence 

1 CGACAAGATT TACGTTTGCA CTTGTCCTTA GCACTAGTCC AGAC 



SEQ ID No. 43 DNA sequence 

1 CGGCATGGCG CATGTGATCA AAGATATCCA GTTCCGATCG CAAATTAAAT 

51 ATTCTTGTCC TAAGGGTTAC CGTC 

SEQ ID No . 44 DNA sequence 

1 GACGGTAACC CTTAGGACAA. GAATATTTAA TTTGCGATCG GAACTGGATA 

51 TCTTTGATCA CATGCGCCAT GCCG 

SEQ ID No. 45 DNA sequence 

1 CATCTCTGGT AATACTGTCA TTTGGGAXAA TAAAACACCG GTTTGTGACC 

SEQ ID No. 4 6 DNA sequence 

I GGTCACAAAC CGGTGTTTTA TTATCCCAAA TGACAGTATT ACCAGAGATG 

SEQ ID No. 47 DNA sequence 

I GACCGAATTA TCTGTGGTCT G 

SEQ ID No . 48 DNA sequence 

1 CAGACCACAG ATAATTCGG7 C 



SEQ ID No. 43 CMX5/cys-MSWPl asMno acid sequence 
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I Met Gin Cys Asn Val Pro Glut Trp Leu Pro 

II Phe Ala Arg Pro Thr Asn Leu Thr Asp Asp 
21 Phe Glu Phe Pro lie Giy Thr Tyr Leu Asn 
31 Tyr Glu Cys Arg Pro Gly Tyr Ser Giy Arg 
41 Pro Phe Ser lie lie Cys Leu Lys Asn Ser 
51 Val Trp Thr Ser Ala Lys Asp Lys Cys Lys 
61 Arg Lys Ser Cys Arg Asn Pro Pro Asp Pro 
71 Val Asn Gly Met Ala His Val He Lys Asp 
81 He Gin Phe Arg Ser Gin lie Lys Tyr Ser 
91 Cys Pro Lys Gly Tyr Arg Leu He Gly Ser 
101 Ser Ser Ala Thr Cys lie He Ser Gly Asn 

III Thr Val lie Trp Asp Asn Lys Thr Pro Vai 
121 Cys Asp Arg He lie Cys Gly Leu Pro Pro 
131 Thr He Ala Asn Giy As? Phe Thr Ser He 
141 Ser Arg Glu Tyr Phe His Tyr Gly Ser Val 
151 Vai Thr Tyr His Cys Asn Leu Giy Ser Arg 
161 Giy Lys Lys Vai Phe Glu Leu Val Gly Glu 
171 Pro Ser lie Tyr Cys Thr Ser Lys As? Asp 
181 Gin Val Giy He Trp Ser Giy Pro Ala Pro 
191 Gin Cys He He Pro Asn Lys Cys - S - S - Cys 

i ' i 
CG 2 H CONK 

Asp Gly Pro Lys Lys Lys Lys Lys Lys Ser 
Pro Ser Lys Ser Ser Giy {K-MyriscoyU 



SEQ ID No. 50 QHS/cys amino acid sequence 



1 


Met 


Gin Cys 


Asn 


Val 


Pro 


Giu 


Trp 


Leu 


Pro 


11 


Phe 


Aia 


Arg 


Pro 


Thr 


Asn 


Leu 


Thr 


Asp 


Asp 


21 


Phe 


Glu 


Phe 


Pro 


He Gly Thr 


Tyr 


Lau 


Asn 


31 


Tyr 


Glu 


Cys 


Arg 


Pro 


Giy 


Tyr 


Ser 


Giy 


Arg 


41 


Pro 


Phe 


Ser 


He 


He 


Cys 


Leu 


Lys 


Asn 


Ser 


51 


Vai 


Trp 


Thr 


Ser 


Aia 


Lys 


Asp 


Lys 


Cys 


Lys 


61 


Arg 


Lys 


Ser 


Cys 


Arg Asn 


Pro 


Pro 


Asp 


Pro 


71 


Val 


Asn Gly "Met: 


Ala 


His 


Val 


He 


Lys 


Asp 


81 


He 


Gin 


Phe 


Arg 


Ser 


Gin 


He 


Lys 


Tyr 


Ser 


91 


Cys 


Pro 


Lys 


Gly 


Tyr 


Arg 


Leu 


He 


Giy 


Ser 
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101 Ser Ser Ala Thr Cys lie lie Ser Gly Asn 
111 Thr Val He Trp Asp Asa Lys Thr Pro Val 
121 Cys Asp Arg He Pro Cys Gly Leu Pro Pro 
131 Thr He Ala Asn Gly Asp Phs Thr Ser He 
141 Ser Arg Glu Tyr Phe His Tyr Gly Ser Val 
151 Val Thr Tyr His Cys Asn Leu Gly Ser Arg 
161 Gly Lys Lys Val Phe Glu Leu Val Gly Glu 
171 Pro Ser He Tyr Cys Thr Ser Lys Asp Asp 
131 Gin Val Gly lie Trp Ser Gly Pro Ala Pro 
191 Gin Cys lie lie Pro Asa Lys Cys 

SEQ ID Ko. 51 OH 6 / cys -MSWP 1 amino acid sequence 

i Met Gin Cys Asa Val Pro Glu Trp Leu Pro 

II Phe Ala Arg Pro Thr Asa Leu Thr Asp Asp 
21 Phe Glu Phe Pro lie Gly Thr Tyr Leu Asa 
31 Tyr Giu Cys Arg Pro Gly Tyr Ser Gly Arg 
41 Pro Phe Ser lie He Cys Leu Lys Asa Ser 
51 Val Trp Thr Ser Ala Lys Asp Lys Cys Lys 
61 Arg Lys Ser Cys Arc Asa Pro Pro Asp Pro 
71 Val Asn Gly Met Ala His Val He Lys Asp 
81 He Gin Phe Arg Ser Gin lie Lys Tyr Ser 
91 Cys Pro Lys Gly Tyr Arg Leu He Gly Ser 
101 Ser Ser Ala Thr Cys He He Ser Gly Asn 

III Thr Val He Trp Asp Asn Lys Thr Pro Val 
121 Cys Asp Arg He Pro Cys Gly Leu Pro Pro 
131 Thr lie Ala Asa Gly Asp Phe Thr Ser He 
141 Ser Arg Glu Tyr Phe His Tyr Gly Ser Val 
151 Val Thr Tyr His Cys Asn Leu Gly Ser Arg 
151 Gly Lys Lys Val Phe Glu Leu Val Gly Glu 
171 ?ro Ser He Tyr Cys Thr Ser Lys Asp Asp 
131 Gin Val Gly He Trp Ser Gly Pro 'Ala Pro 
191 Gin Cys He lie Pro Asa Lys Cys-S-S-Cys 

I I 

C0 2 H CONK, 
Asp Gly Pro Lys Lys Lys Lys Lys Lys Ser 
Pro Ser Lys Ser Ser Gly (N-Myristoyl) 



Seq ID No. 52 DNA sequence 
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1 CGCACCGCAG TGCATCATCC CGAACAAAGA TGGCCCGAGC GAAATTCTGC 

51 GTGGCGATTT TAGCAGCTGC TA 

Seq ID No. 53 DNA sequence 

i AGCTTAGCAG CTGCTAAAAT CGCCACGCAG AATTTCGCTC GGGCCATCTT 

51 TGTTCGGGAT GATGCACTGC GGTGCGGGCC 

SEQ ID NO. 54: CM7rgdcys amino acid sequence 

I Met Gin Cys Asn Ala Pro Giu Trp Leu Pro 

II Phe Ala Arc Pro Thr Asn Leu Thr Asp Giu 
21 Phe Giu Phe Pro lie Giy Thr Tyr Leu Asn 
31 Tyr Giu Cys Arg Pro Giy Tyr Ser Giy Arg 
41 Pro Phe Ser lie lie Cys Leu Lys Asn Ser 
51 Vai Trp Thr Giy Aia Lys Asp Arg Cys Arg 
Si Arg Lys Ser Cys Arg Asn Pro Pro Asp Pro 
71 Vai Asn Giy Met Vai His Vai lie Lys Giy 
81 He Gin Phe Giy Ser Gin He Lys Tyr Ser 
51 Cys Thr Lys Giy Tyr Arg Leu He Giy Ser 
101 Ser Ser Ala Thr Cys He He Ser Giy Asp 

III Thr Vai lis Trp Asp Asn Giu Thr Pro He 
121 Cys Asp Arg He Pro Cys Giy Leu Pro Pro 
131 Thr He Aia Asn Giy Asp Phe Thr Ser He 
141 Ser Arg Giu Tyr Phe His Tyr Giy Ser Vai 
151 Vai Thr Tyr His Cys Asn Leu Giy Ser Arg 
151 Giy Lys Lys Vai Phe Giu Leu Vai Giy Giu 
171 Pro Ser He Tyr Cys Thr Ser Lys Asp Asp 
18 1 Gin Vai Giy He Trp Ser Giy Pro Ala Pro 
131 Gin Cys He He Pro Asn Lys Asp Giy Pro 
20 1 Ser Giu lie Leu Arg Giy Asp Phe Ser Ser 
211 Cys 

Seq ID No. 55 DNA sequence 

1 CGCACCGCAG TGCATCATCC CGAACAAAGC GGCGCCC^GC GTGATTGGCT 

51 TCCGTATTCT GCTGCTGAAA GTGGCGGGCT GATA 



Seq ID No. 5 6 DNA sequence 
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1 AGCTTATCAG CCCGCCACTT TCAGCAGCAG AATACGGAAG CCAATCACGC 

51 TGGGCGCCGC TTTGTTCGGG ATGATGCACT GCGGTGCGGG CC 

SSQ ID NO, 57; Q17Tcell amino acid sequence 



1 


Met 


Gin 


Cys Asn Ala Pro 


Glu Trp 


Leu 


Pro 


11 


Phe 


Ala 


Arg Pro Thr Asn 


Leu Thr 


Asp 


Glu 


21 


Phe 


Glu 


Phe Pro lis Gly 


Thr Tyr Leu 


Asn 


31 


Tyr 


Glu 


Cys Arg Pro Gly 


Tyr Ser 


Gly Arg 


41 


Pro 


Phe 


Ser lie lis Cys 


Leu Lys 


Asn 


Ser 


51 


Val 


Trp 


Thr Gly Ala Lys 


Asp Arg 


Cys 


Arg 


SI 


Arg 


Lys 


Ser Cys Arg Asn 


Pro Pro 


Asp 


Pro 


71 


Val 


Asn 


Gly Met Val His 


Val He 


Lys 


Gly 


81 


lie 


Gin 


Phe Gly Ser Gin 


He Lys 


Tyr 


Ser 


91 


Cys 


Thr 


Lys Gly Tyr Arg 


Leu He 


Gly 


Ser 


101 


Ser 


Ser 


Ala Thr Cys He 


lie Ser 


Gly 


Asp 


111 


Thr 


Val 


He Trp Asp Asn 


Glu Thr 


Pro 


He 


121 


Cys 


Asp 


Arg He Pro Cys 


Gly Leu 


Pro 


Pro 


13 1 


Thr 


lie 


Ala Asn Gly Asp 


Phe Thr 


Ser 


He 


141 


Ser 


Arg 


Glu Tyr Phe His 


Tyr Gly 


Ser 


Val 


151 


Val 


Thr 


Tyr His Cys Asn Leu Gly Ser Arg 


lffl 


Gly Lys 


Lys Val Phe Glu 


Leu Val 


Gly Glu 


171 


Pro 


Ser 


lie Tyr Cys Thr 


Ser Lys 


Asp 


Asp 


131 


Gin 


Val 


Gly He Trp Ser 


Gly Pro Ala 


Pro 


131 


Gin 


Cys 


He He Pro Asn 


Lys Aia 


Ala 


Pro 


2Q1 


Ser 


Val 


He Gly Phe Arg 


Ha Leu 


Leu 


Leu 


211 


Lys 


Val 


Ala Gly 
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Claims 

1 . A soluble polypeptide comprising, in sequence, one to four short consensus 
repeats (SCR) selected from SCR 1, 2, 3 and 4 of long homologous repeat A (LHR-A) as 
the only structurally and functionally intact SCR domains of CR1 and including at least 
SCR3, in which one or more of the native amino acids are substituted with the following 
Val 4, Asp 19, Ser 53, Lys 57, Ala 74, Asp 79, Arg 84, Pro 91, Asn 109, Lys 1 16, Val 

1 19, Ala 132, Thr 137, He 139, Ser 140, Tyr 143, His 153, Leu 156, Arg 159, Lys 161, 
Lys 177, Gly 230, Ser 235, His 236. 

(Numbering is from glutamine as residue i of mature CR1 . The amino-acid indicated is 
that which replaces the CR1 residue at the position specified.) 

2. A polypeptide according to claim 1 which comprises, in sequence, SCR 1 , 2, 3 
and 4 of LHR-A or SCR 1, 2 and 3 of LHR-A as the only structurally and functionally 
intact SCR domains of CRL 

3. A polypeptide according to claim 1 or 2 comprising the native interdomain 
sequences in CR1 optionally substituted with the corresponding predicted aminoacids in 
the CRl -like sequence, namely Lys59 and/or He 124. (Numbering is from glutamine as 
residue 1 of mature CR1 . The amino-acid indicated is that which replaces the CR1 
residue at the position specified.) 

4. A polypeptide according to claim 2 or 3 of formula (I): 

NH 2 -V 1 -SCR1 -W I -SCR2-X 1 -SCR3-Y 1 -OH (I) 
in which SCR1 represents residues 2-58 of mature CRl, SCR2 represents residues 63-120 
of mature CRl, SCR3 represents residues 125-191 of mature CRl, and containing at ieast 
one of the substitutions as aforesaid and V*, W*, X* and represent bonds or short 
linking sequences of amino acids, preferably 1 to 5 residues in length and which are 
preferably derived from native interdomain sequences in CRl, optionally substituted as 
defined in claim 3. 

5. A polypeptide according to claim 4 wherein , X* and represent residues 
59-62, 121-124 and 192-196, respectively, of mature CRl, optionally substituted as 
defined in claim 3, and V* represents residue 1 of mature CRl optionally linked via its 
N-terminus to methionine. 

6. A polypeptide according to claim 2 or 3 of formula (II): 

NH2-V2-SCRl-W2-SCR2-X 2 -SCR3-Y 2 -SCR4-Z 2 OH (II) 
in which SCR1, SCR2 and SCR3 are as hereinbefore defined, SCR4 represents residues 
197-252 of mature CRl and containing at least one of the substitutions as aforesaid, and 
V 2 , W 2 , X 2 , Y 2 and Z 2 represent bonds or short linking sequences of amino acids, 
preferably 1 to 5 residues in length and which are preferably derived from native 
interdomain sequences in CR1 T optionally substituted as defined in claim 3. 
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7. A polypeptide according to claim 6 wherein W 2 , X 2 Y 2 and Z 2 represent 

residues 59-62, 121-124, 192-196, and residues 253 respectively, of mature CRl, 

optionally substituted as defined in claim 3> and V 2 represents residue 1 of mature CRl 

optionally linked via its N-tenninus to methionine. 

3. A polypeptide according to claim I or 3 of formula (III): 

NH 2 -X 3 -SCR3-Y 3 -QH " (HI) 

in which SCR3 is as hereinbefore. defined, containing at least one of the substitutions as 

aforesaid, and in a preferred embodiment, all those of Sequence Group 1 aforesaid, and in 

a preferred embodiment, ail those of Sequence Group 1, namely: A132, T137 t 1139, 

3140, Y143, H153, L156 ; R159, K161, K177, and X 3 and Y\ and X 3 and Y 3 represent 
bonds or short linking sequences of amino acids, preferably 1 to 5 residues in length and 
which are preferably derived from native interdomain sequences ir CRl, optionally 
substituted as defined in claim 3. 

9. A polypeptide according to claim 8 wherein X 3 represents amino acids 122-124 
of mature CRl, optionally substituted as denned in claim 3. optionally linked to 
methionine at its N-terminus and Y~ represents amino acids 192-196 of mature CRl . 

10. A polypeptide according to claim 1 or 3 of formula (IV): 

>?H2-X 4 -SCR3-Y 4 -SCR4-Z^OH (IV) 
in which SCR3 and SCR^- are as hereinbefore defined containing at least one of the 
substitutions as aforesaid and X", Y 4 and Z 4 represent bonds or short linking sequences 
of amino acics 7 preferably 1 to 5 residues in length and which are preferably derived from 
native interdomarn sequences in CRl f optionally substituted as defined in claim 3. 

11. A polypeptide according to claim 1 0 wherein X 4 represents amino acids 122424 
of mature CRl, optionally substituted as defined in claim 3, optionally linked to 
methionine at its N-terminus and Y~ and Z 4 represent ammo acids 192-196 and 253 
respectively of mature CR 1 . \ 

12. A polypeptide according to any preceding claim wherein the SCR3 domain is 
substituted with all ten residues found in the corresponding pseudogene sequence, namely 
(in single letter code): 

A132, T137, 1139, S140 : Y143, H153, L156, R159, K16i, K177 (Sequence Group 1) 
and the remaining domains have the sequence of mature CRl. 

13. A polypeptide according to claim 1 selected from SEQ ED NOs: 1, 9, 1 1, 13, 15, 
17,, 19, 21, 23, 25, 27 and 29. 

14. A soluble derivative of the soluble polypeptide of any preceding claim, said 
derivative comprising two or more heterologous membrane binding elements with low 
membrane affinity covalently associated with the polypeptide which elements are capable 



AWEW60- SHEET 



WORD:39l61PCTJO 

61 



of interacting independently and with thermodynamic additivity with components of 
cellular membranes exposed to extracellular fluids. 

15. A derivative according to ciaim 14 compring two to eight membrane binding 
elements selected from: fatty acid derivatives; iigands of known integral membrane 
proteins; sequences derived from the complementarity-determining region of monoclonal 
antibodies raised against epitopes of membrane proteins; membrane binding sequences 
identified through screening of random chemical libraries. 

16. A derivaiive according to claim 1 4 or 15 having the following structure: 
PPI-{L-rW]Jn-X 

in which; 

P is the soluble polypeptide, 

each L is independently a flexible linker group. 

each W is independently a vcptidic membrane binding element, 

n is an integer of 1 or more and 

X is a peptidic or non-peptidic membrane-binding entity which may be covalentiy 
linked to any W. 

17. A polypeptide derivative which is SEQ ID NO: 34,49 or 51 

1 S . The polypeptide portion of a derivative according to any of claims 14 to 1 7, 
wherein the polypeptide portion is SEQ ED NO: 31, 36, 50, 54 or 57. 

19. A process for preparing a polypeptide according to any of claims 1 to :3 which : 
process comprises expressing DNA encoding said polypeptide in a recombinant host cell 
and recovering the product 

20. A DNA polymer comprising a nucleotide sequence that encodes the polypeptide 
of any of claims 1 to 13, or 18. 

21 . A DNA polymer according to claim 20 selected from SEQ ID NOs; 1,10, 12, 14, 
16, 13, 20, 22, 24, 26, 28, or 30, . ; 

22. A replicable expression vector capable, in the host cell, of expressing the DNA 
polymer of claim 20 or 2 1 . 

23. A host ceil transformed with a rep lie able expression vector of claim 22. 

24. A process for preparing a derivative according to any of claims 14 to 17 which 
process comprises expressing DNA encoding the polypeptide pardon of said derivative in ^ 
a recombinant host cell and recovering the product and thereafter post translationally 
modifying the polypeptide to chemically introduce membrane binding elements. 

25. A pharmaceutical composition comprising a therapeutically effective amount of a * 
polypeptide or derivative of any of claims 1 to 17, and a pharmaceutical^ acceptable 
carrier or excipient. 
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26. A method of treating a disease or disorder associated with inflammation or 
inappropriate complement activation comprising administering to a subject in need of 
such treatment a therapeutically effective amount of a polypeptide or derivative of any of 
claims 1 to 17. 

27. The use of a polypeptide or derivative of any of claims 1 to 1 7 in the manufacture 
of a medicament fcr the treatment of a disease or disorder associated with inflammation 
or inappropriate complement activation. 
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